Overview
Brought to you by YData
Dataset statistics
| Number of variables | 60 |
|---|---|
| Number of observations | 584201 |
| Missing cells | 12827277 |
| Missing cells (%) | 36.6% |
| Total size in memory | 267.4 MiB |
| Average record size in memory | 480.0 B |
Variable types
| Text | 60 |
|---|
Dataset
| Description | Herpetology NMNH Extant Specimen Records 0054921-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.rf2che |
institutionID has constant value "urn:lsid:biocol.org:col:34871" | Constant |
collectionID has constant value "urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0" | Constant |
institutionCode has constant value "USNM" | Constant |
collectionCode has constant value "HERP" | Constant |
datasetName has constant value "NMNH Extant Biology" | Constant |
kingdom has constant value "Animalia" | Constant |
phylum has constant value "Chordata" | Constant |
taxonRank has constant value "subspecies" | Constant |
recordNumber has 583925 (> 99.9%) missing values | Missing |
sex has 527948 (90.4%) missing values | Missing |
lifeStage has 539845 (92.4%) missing values | Missing |
associatedMedia has 579054 (99.1%) missing values | Missing |
associatedSequences has 583480 (99.9%) missing values | Missing |
occurrenceRemarks has 557618 (95.4%) missing values | Missing |
fieldNumber has 584193 (> 99.9%) missing values | Missing |
eventDate has 37781 (6.5%) missing values | Missing |
startDayOfYear has 55728 (9.5%) missing values | Missing |
endDayOfYear has 55637 (9.5%) missing values | Missing |
year has 37781 (6.5%) missing values | Missing |
month has 54300 (9.3%) missing values | Missing |
day has 85891 (14.7%) missing values | Missing |
waterBody has 555994 (95.2%) missing values | Missing |
islandGroup has 564324 (96.6%) missing values | Missing |
island has 576136 (98.6%) missing values | Missing |
stateProvince has 17001 (2.9%) missing values | Missing |
county has 191557 (32.8%) missing values | Missing |
minimumElevationInMeters has 332173 (56.9%) missing values | Missing |
maximumElevationInMeters has 333225 (57.0%) missing values | Missing |
verbatimElevation has 331608 (56.8%) missing values | Missing |
decimalLatitude has 162901 (27.9%) missing values | Missing |
decimalLongitude has 162901 (27.9%) missing values | Missing |
geodeticDatum has 438700 (75.1%) missing values | Missing |
coordinateUncertaintyInMeters has 439218 (75.2%) missing values | Missing |
verbatimLatitude has 334540 (57.3%) missing values | Missing |
verbatimLongitude has 334562 (57.3%) missing values | Missing |
georeferenceProtocol has 439136 (75.2%) missing values | Missing |
georeferenceRemarks has 443625 (75.9%) missing values | Missing |
identificationQualifier has 583784 (99.9%) missing values | Missing |
typeStatus has 570681 (97.7%) missing values | Missing |
identifiedBy has 584125 (> 99.9%) missing values | Missing |
specificEpithet has 13122 (2.2%) missing values | Missing |
infraspecificEpithet has 556206 (95.2%) missing values | Missing |
taxonRank has 556206 (95.2%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
catalogNumber has unique values | Unique |
Reproduction
| Analysis started | 2025-01-14 16:50:03.383103 |
|---|---|
| Analysis finished | 2025-01-14 16:50:17.330904 |
| Duration | 13.95 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 584201 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 584201 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1317203362 |
|---|---|
| 2nd row | 1317203927 |
| 3rd row | 1317204107 |
| 4th row | 1322537851 |
| 5th row | 1322539748 |
| Value | Count | Frequency (%) |
| 1317203362 | 1 | < 0.1% |
| 1322539748 | 1 | < 0.1% |
| 1322560470 | 1 | < 0.1% |
| 1322558547 | 1 | < 0.1% |
| 1317274722 | 1 | < 0.1% |
| 1317214758 | 1 | < 0.1% |
| 1317204107 | 1 | < 0.1% |
| 1322537851 | 1 | < 0.1% |
| 1317211425 | 1 | < 0.1% |
| 1322569185 | 1 | < 0.1% |
| Other values (584191) | 584191 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1289572 | |
| 3 | 931906 | |
| 2 | 745858 | |
| 8 | 464209 | 7.9% |
| 9 | 461174 | 7.9% |
| 0 | 439271 | 7.5% |
| 7 | 430436 | 7.4% |
| 4 | 371688 | 6.4% |
| 5 | 355028 | 6.1% |
| 6 | 352868 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5842010 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1289572 | |
| 3 | 931906 | |
| 2 | 745858 | |
| 8 | 464209 | 7.9% |
| 9 | 461174 | 7.9% |
| 0 | 439271 | 7.5% |
| 7 | 430436 | 7.4% |
| 4 | 371688 | 6.4% |
| 5 | 355028 | 6.1% |
| 6 | 352868 | 6.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5842010 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1289572 | |
| 3 | 931906 | |
| 2 | 745858 | |
| 8 | 464209 | 7.9% |
| 9 | 461174 | 7.9% |
| 0 | 439271 | 7.5% |
| 7 | 430436 | 7.4% |
| 4 | 371688 | 6.4% |
| 5 | 355028 | 6.1% |
| 6 | 352868 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5842010 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1289572 | |
| 3 | 931906 | |
| 2 | 745858 | |
| 8 | 464209 | 7.9% |
| 9 | 461174 | 7.9% |
| 0 | 439271 | 7.5% |
| 7 | 430436 | 7.4% |
| 4 | 371688 | 6.4% |
| 5 | 355028 | 6.1% |
| 6 | 352868 | 6.0% |
modified
Text
| Distinct | 11116 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 6239 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | 2022-03-25 16:29:00 |
|---|---|
| 2nd row | 2022-12-14 12:20:00 |
| 3rd row | 2022-07-25 13:54:00 |
| 4th row | 2022-03-25 16:12:00 |
| 5th row | 2022-03-25 16:41:00 |
| Value | Count | Frequency (%) |
| 2022-08-17 | 164186 | 14.1% |
| 2022-03-25 | 159648 | 13.7% |
| 2018-10-02 | 114364 | 9.8% |
| 2018-10-01 | 11147 | 1.0% |
| 2022-09-02 | 9885 | 0.8% |
| 2024-09-04 | 7897 | 0.7% |
| 2022-12-02 | 7119 | 0.6% |
| 2014-08-26 | 6094 | 0.5% |
| 2020-09-23 | 6045 | 0.5% |
| 2014-08-28 | 5728 | 0.5% |
| Other values (2040) | 676289 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2825947 | |
| 2 | 1945269 | |
| 1 | 1362306 | |
| - | 1168402 | |
| : | 1168402 | |
| 584201 | 5.3% | |
| 8 | 454189 | 4.1% |
| 5 | 397958 | 3.6% |
| 3 | 369937 | 3.3% |
| 7 | 249238 | 2.2% |
| Other values (3) | 573970 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8178814 | |
| Dash Punctuation | 1168402 | 10.5% |
| Other Punctuation | 1168402 | 10.5% |
| Space Separator | 584201 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2825947 | |
| 2 | 1945269 | |
| 1 | 1362306 | |
| 8 | 454189 | 5.6% |
| 5 | 397958 | 4.9% |
| 3 | 369937 | 4.5% |
| 7 | 249238 | 3.0% |
| 4 | 229171 | 2.8% |
| 6 | 174007 | 2.1% |
| 9 | 170792 | 2.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1168402 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1168402 |
Space Separator
| Value | Count | Frequency (%) |
| 584201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11099819 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2825947 | |
| 2 | 1945269 | |
| 1 | 1362306 | |
| - | 1168402 | |
| : | 1168402 | |
| 584201 | 5.3% | |
| 8 | 454189 | 4.1% |
| 5 | 397958 | 3.6% |
| 3 | 369937 | 3.3% |
| 7 | 249238 | 2.2% |
| Other values (3) | 573970 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11099819 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2825947 | |
| 2 | 1945269 | |
| 1 | 1362306 | |
| - | 1168402 | |
| : | 1168402 | |
| 584201 | 5.3% | |
| 8 | 454189 | 4.1% |
| 5 | 397958 | 3.6% |
| 3 | 369937 | 3.3% |
| 7 | 249238 | 2.2% |
| Other values (3) | 573970 | 5.2% |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:34871 |
| 3rd row | urn:lsid:biocol.org:col:34871 |
| 4th row | urn:lsid:biocol.org:col:34871 |
| 5th row | urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:34871 | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2336804 | |
| : | 2336804 | |
| l | 1752603 | 10.3% |
| i | 1168402 | 6.9% |
| r | 1168402 | 6.9% |
| c | 1168402 | 6.9% |
| g | 584201 | 3.4% |
| 7 | 584201 | 3.4% |
| 8 | 584201 | 3.4% |
| 4 | 584201 | 3.4% |
| Other values (8) | 4673608 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11099819 | |
| Other Punctuation | 2921005 | 17.2% |
| Decimal Number | 2921005 | 17.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2336804 | |
| l | 1752603 | |
| i | 1168402 | |
| r | 1168402 | |
| c | 1168402 | |
| g | 584201 | 5.3% |
| u | 584201 | 5.3% |
| b | 584201 | 5.3% |
| d | 584201 | 5.3% |
| s | 584201 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 584201 | |
| 8 | 584201 | |
| 4 | 584201 | |
| 3 | 584201 | |
| 1 | 584201 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2336804 | |
| . | 584201 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11099819 | |
| Common | 5842010 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2336804 | |
| l | 1752603 | |
| i | 1168402 | |
| r | 1168402 | |
| c | 1168402 | |
| g | 584201 | 5.3% |
| u | 584201 | 5.3% |
| b | 584201 | 5.3% |
| d | 584201 | 5.3% |
| s | 584201 | 5.3% |
Common
| Value | Count | Frequency (%) |
| : | 2336804 | |
| 7 | 584201 | 10.0% |
| 8 | 584201 | 10.0% |
| 4 | 584201 | 10.0% |
| 3 | 584201 | 10.0% |
| . | 584201 | 10.0% |
| 1 | 584201 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16941829 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 2336804 | |
| : | 2336804 | |
| l | 1752603 | 10.3% |
| i | 1168402 | 6.9% |
| r | 1168402 | 6.9% |
| c | 1168402 | 6.9% |
| g | 584201 | 3.4% |
| 7 | 584201 | 3.4% |
| 8 | 584201 | 3.4% |
| 4 | 584201 | 3.4% |
| Other values (8) | 4673608 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 |
|---|---|
| 2nd row | urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 |
| 3rd row | urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 |
| 4th row | urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 |
| 5th row | urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 |
| Value | Count | Frequency (%) |
| urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2921005 | 11.1% |
| - | 2336804 | 8.9% |
| u | 1752603 | 6.7% |
| c | 1752603 | 6.7% |
| 7 | 1752603 | 6.7% |
| 0 | 1752603 | 6.7% |
| b | 1752603 | 6.7% |
| d | 1752603 | 6.7% |
| 4 | 1168402 | 4.4% |
| f | 1168402 | 4.4% |
| Other values (10) | 8178814 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11684020 | |
| Decimal Number | 11099819 | |
| Dash Punctuation | 2336804 | 8.9% |
| Other Punctuation | 1168402 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 1752603 | |
| c | 1752603 | |
| b | 1752603 | |
| d | 1752603 | |
| f | 1168402 | |
| a | 1168402 | |
| i | 584201 | 5.0% |
| r | 584201 | 5.0% |
| e | 584201 | 5.0% |
| n | 584201 | 5.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2921005 | |
| 7 | 1752603 | |
| 0 | 1752603 | |
| 4 | 1168402 | 10.5% |
| 8 | 1168402 | 10.5% |
| 3 | 1168402 | 10.5% |
| 9 | 584201 | 5.3% |
| 6 | 584201 | 5.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2336804 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1168402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14605025 | |
| Latin | 11684020 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2921005 | |
| - | 2336804 | |
| 7 | 1752603 | |
| 0 | 1752603 | |
| 4 | 1168402 | 8.0% |
| : | 1168402 | 8.0% |
| 8 | 1168402 | 8.0% |
| 3 | 1168402 | 8.0% |
| 9 | 584201 | 4.0% |
| 6 | 584201 | 4.0% |
Latin
| Value | Count | Frequency (%) |
| u | 1752603 | |
| c | 1752603 | |
| b | 1752603 | |
| d | 1752603 | |
| f | 1168402 | |
| a | 1168402 | |
| i | 584201 | 5.0% |
| r | 584201 | 5.0% |
| e | 584201 | 5.0% |
| n | 584201 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26289045 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2921005 | 11.1% |
| - | 2336804 | 8.9% |
| u | 1752603 | 6.7% |
| c | 1752603 | 6.7% |
| 7 | 1752603 | 6.7% |
| 0 | 1752603 | 6.7% |
| b | 1752603 | 6.7% |
| d | 1752603 | 6.7% |
| 4 | 1168402 | 4.4% |
| f | 1168402 | 4.4% |
| Other values (10) | 8178814 |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 584201 | |
| S | 584201 | |
| N | 584201 | |
| M | 584201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2336804 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 584201 | |
| S | 584201 | |
| N | 584201 | |
| M | 584201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2336804 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 584201 | |
| S | 584201 | |
| N | 584201 | |
| M | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2336804 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 584201 | |
| S | 584201 | |
| N | 584201 | |
| M | 584201 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | HERP |
|---|---|
| 2nd row | HERP |
| 3rd row | HERP |
| 4th row | HERP |
| 5th row | HERP |
| Value | Count | Frequency (%) |
| herp | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| H | 584201 | |
| E | 584201 | |
| R | 584201 | |
| P | 584201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2336804 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 584201 | |
| E | 584201 | |
| R | 584201 | |
| P | 584201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2336804 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| H | 584201 | |
| E | 584201 | |
| R | 584201 | |
| P | 584201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2336804 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| H | 584201 | |
| E | 584201 | |
| R | 584201 | |
| P | 584201 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 584201 | |
| extant | 584201 | |
| biology | 584201 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1168402 | 10.5% |
| 1168402 | 10.5% | |
| t | 1168402 | 10.5% |
| o | 1168402 | 10.5% |
| M | 584201 | 5.3% |
| H | 584201 | 5.3% |
| E | 584201 | 5.3% |
| x | 584201 | 5.3% |
| a | 584201 | 5.3% |
| n | 584201 | 5.3% |
| Other values (5) | 2921005 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6426211 | |
| Uppercase Letter | 3505206 | |
| Space Separator | 1168402 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1168402 | |
| o | 1168402 | |
| x | 584201 | |
| a | 584201 | |
| n | 584201 | |
| i | 584201 | |
| l | 584201 | |
| g | 584201 | |
| y | 584201 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1168402 | |
| M | 584201 | |
| H | 584201 | |
| E | 584201 | |
| B | 584201 |
Space Separator
| Value | Count | Frequency (%) |
| 1168402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9931417 | |
| Common | 1168402 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1168402 | |
| t | 1168402 | |
| o | 1168402 | |
| M | 584201 | 5.9% |
| H | 584201 | 5.9% |
| E | 584201 | 5.9% |
| x | 584201 | 5.9% |
| a | 584201 | 5.9% |
| n | 584201 | 5.9% |
| B | 584201 | 5.9% |
| Other values (4) | 2336804 |
Common
| Value | Count | Frequency (%) |
| 1168402 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11099819 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1168402 | 10.5% |
| 1168402 | 10.5% | |
| t | 1168402 | 10.5% |
| o | 1168402 | 10.5% |
| M | 584201 | 5.3% |
| H | 584201 | 5.3% |
| E | 584201 | 5.3% |
| x | 584201 | 5.3% |
| a | 584201 | 5.3% |
| n | 584201 | 5.3% |
| Other values (5) | 2921005 |
basisOfRecord
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 17.00021739 |
| Min length | 17 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PreservedSpecimen |
|---|---|
| 2nd row | PreservedSpecimen |
| 3rd row | PreservedSpecimen |
| 4th row | PreservedSpecimen |
| 5th row | PreservedSpecimen |
| Value | Count | Frequency (%) |
| preservedspecimen | 584074 | |
| machineobservation | 127 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2920624 | |
| r | 1168275 | |
| i | 584328 | 5.9% |
| n | 584328 | 5.9% |
| c | 584201 | 5.9% |
| s | 584201 | 5.9% |
| v | 584201 | 5.9% |
| m | 584074 | 5.9% |
| P | 584074 | 5.9% |
| p | 584074 | 5.9% |
| Other values (9) | 1169164 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8763142 | |
| Uppercase Letter | 1168402 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2920624 | |
| r | 1168275 | |
| i | 584328 | 6.7% |
| n | 584328 | 6.7% |
| c | 584201 | 6.7% |
| s | 584201 | 6.7% |
| v | 584201 | 6.7% |
| m | 584074 | 6.7% |
| p | 584074 | 6.7% |
| d | 584074 | 6.7% |
| Other values (5) | 762 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 584074 | |
| S | 584074 | |
| M | 127 | < 0.1% |
| O | 127 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9931544 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2920624 | |
| r | 1168275 | |
| i | 584328 | 5.9% |
| n | 584328 | 5.9% |
| c | 584201 | 5.9% |
| s | 584201 | 5.9% |
| v | 584201 | 5.9% |
| m | 584074 | 5.9% |
| P | 584074 | 5.9% |
| p | 584074 | 5.9% |
| Other values (9) | 1169164 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9931544 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2920624 | |
| r | 1168275 | |
| i | 584328 | 5.9% |
| n | 584328 | 5.9% |
| c | 584201 | 5.9% |
| s | 584201 | 5.9% |
| v | 584201 | 5.9% |
| m | 584074 | 5.9% |
| P | 584074 | 5.9% |
| p | 584074 | 5.9% |
| Other values (9) | 1169164 |
occurrenceID
Text
Unique 
| Distinct | 584201 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 584201 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/3000ac9b1-ec0b-4be2-939f-464ad355cc84 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/30010adfb-58e1-4e98-8d39-ee055b3463fa |
| 3rd row | http://n2t.net/ark:/65665/30012ab17-d2a1-470c-a774-540bc6cffb00 |
| 4th row | http://n2t.net/ark:/65665/3ec02d332-deb7-4b55-ba3d-5a5d6ca577c9 |
| 5th row | http://n2t.net/ark:/65665/3ec19a125-2484-4fa3-b6b7-7d87199a6994 |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/3000ac9b1-ec0b-4be2-939f-464ad355cc84 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec19a125-2484-4fa3-b6b7-7d87199a6994 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ed02751f-656c-458c-80fa-90bf891a2063 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3eced04ac-39a4-455a-85e7-7cb0b4299f6b | 1 | < 0.1% |
| http://n2t.net/ark:/65665/303348f04-82b4-456c-be8d-764af3205229 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3008b1b21-05b1-4e8d-b34c-1e3a96daecf7 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/30012ab17-d2a1-470c-a774-540bc6cffb00 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec02d332-deb7-4b55-ba3d-5a5d6ca577c9 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3006575b6-ca0a-42bd-b75d-3241cc3e332d | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ed66e63b-4fff-4639-8abf-a635d31dd047 | 1 | < 0.1% |
| Other values (584191) | 584191 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 2921005 | 7.9% |
| 6 | 2847614 | 7.7% |
| - | 2336804 | 6.3% |
| t | 2336804 | 6.3% |
| 5 | 2265995 | 6.2% |
| a | 1826256 | 5.0% |
| e | 1681096 | 4.6% |
| 2 | 1680524 | 4.6% |
| 3 | 1680017 | 4.6% |
| 4 | 1678083 | 4.6% |
| Other values (16) | 15550465 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15919515 | |
| Lowercase Letter | 13874736 | |
| Other Punctuation | 4673608 | 12.7% |
| Dash Punctuation | 2336804 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2336804 | |
| a | 1826256 | |
| e | 1681096 | |
| b | 1241364 | |
| n | 1168402 | |
| c | 1094913 | |
| f | 1094889 | |
| d | 1094208 | |
| k | 584201 | 4.2% |
| r | 584201 | 4.2% |
| Other values (2) | 1168402 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2847614 | |
| 5 | 2265995 | |
| 2 | 1680524 | |
| 3 | 1680017 | |
| 4 | 1678083 | |
| 9 | 1244007 | |
| 8 | 1240305 | |
| 1 | 1096638 | 6.9% |
| 7 | 1094431 | 6.9% |
| 0 | 1091901 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2921005 | |
| : | 1168402 | 25.0% |
| . | 584201 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2336804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22929927 | |
| Latin | 13874736 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 2921005 | |
| 6 | 2847614 | |
| - | 2336804 | |
| 5 | 2265995 | |
| 2 | 1680524 | |
| 3 | 1680017 | |
| 4 | 1678083 | |
| 9 | 1244007 | 5.4% |
| 8 | 1240305 | 5.4% |
| : | 1168402 | 5.1% |
| Other values (4) | 3867171 |
Latin
| Value | Count | Frequency (%) |
| t | 2336804 | |
| a | 1826256 | |
| e | 1681096 | |
| b | 1241364 | |
| n | 1168402 | |
| c | 1094913 | |
| f | 1094889 | |
| d | 1094208 | |
| k | 584201 | 4.2% |
| r | 584201 | 4.2% |
| Other values (2) | 1168402 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36804663 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 2921005 | 7.9% |
| 6 | 2847614 | 7.7% |
| - | 2336804 | 6.3% |
| t | 2336804 | 6.3% |
| 5 | 2265995 | 6.2% |
| a | 1826256 | 5.0% |
| e | 1681096 | 4.6% |
| 2 | 1680524 | 4.6% |
| 3 | 1680017 | 4.6% |
| 4 | 1678083 | 4.6% |
| Other values (16) | 15550465 |
catalogNumber
Text
Unique 
| Distinct | 584201 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 11 |
| Mean length | 10.93256944 |
| Min length | 6 |
Unique
| Unique | 584201 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | USNM 231889 |
|---|---|
| 2nd row | USNM 487703 |
| 3rd row | USNM 297347 |
| 4th row | USNM 322261 |
| 5th row | USNM 319170 |
| Value | Count | Frequency (%) |
| usnm | 584201 | |
| herp | 5833 | 0.5% |
| tissue | 5706 | 0.5% |
| image | 127 | < 0.1% |
| 2847 | 3 | < 0.1% |
| 2877 | 3 | < 0.1% |
| 2872 | 3 | < 0.1% |
| 2940 | 3 | < 0.1% |
| 2715 | 3 | < 0.1% |
| 9 | 3 | < 0.1% |
| Other values (581072) | 584183 |
Most occurring characters
| Value | Count | Frequency (%) |
| 595867 | 9.3% | |
| U | 584201 | 9.1% |
| N | 584201 | 9.1% |
| M | 584201 | 9.1% |
| S | 584201 | 9.1% |
| 4 | 393545 | 6.2% |
| 2 | 393142 | 6.2% |
| 3 | 392798 | 6.2% |
| 1 | 391284 | 6.1% |
| 5 | 383581 | 6.0% |
| Other values (17) | 1499797 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3395944 | |
| Uppercase Letter | 2348470 | |
| Space Separator | 595867 | 9.3% |
| Lowercase Letter | 46537 | 0.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 393545 | |
| 2 | 393142 | |
| 3 | 392798 | |
| 1 | 391284 | |
| 5 | 383581 | |
| 6 | 292686 | |
| 7 | 291064 | |
| 8 | 290326 | |
| 9 | 285200 | |
| 0 | 282318 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11666 | |
| s | 11412 | |
| r | 5833 | |
| p | 5833 | |
| i | 5706 | |
| u | 5706 | |
| m | 127 | 0.3% |
| a | 127 | 0.3% |
| g | 127 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 584201 | |
| N | 584201 | |
| M | 584201 | |
| S | 584201 | |
| H | 5833 | 0.2% |
| T | 5706 | 0.2% |
| I | 127 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 595867 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3991811 | |
| Latin | 2395007 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 584201 | |
| N | 584201 | |
| M | 584201 | |
| S | 584201 | |
| e | 11666 | 0.5% |
| s | 11412 | 0.5% |
| H | 5833 | 0.2% |
| r | 5833 | 0.2% |
| p | 5833 | 0.2% |
| T | 5706 | 0.2% |
| Other values (6) | 11920 | 0.5% |
Common
| Value | Count | Frequency (%) |
| 595867 | ||
| 4 | 393545 | |
| 2 | 393142 | |
| 3 | 392798 | |
| 1 | 391284 | |
| 5 | 383581 | |
| 6 | 292686 | |
| 7 | 291064 | |
| 8 | 290326 | |
| 9 | 285200 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6386818 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 595867 | 9.3% | |
| U | 584201 | 9.1% |
| N | 584201 | 9.1% |
| M | 584201 | 9.1% |
| S | 584201 | 9.1% |
| 4 | 393545 | 6.2% |
| 2 | 393142 | 6.2% |
| 3 | 392798 | 6.2% |
| 1 | 391284 | 6.1% |
| 5 | 383581 | 6.0% |
| Other values (17) | 1499797 |
recordNumber
Text
Missing 
| Distinct | 273 |
|---|---|
| Distinct (%) | 98.9% |
| Missing | 583925 |
| Missing (%) | > 99.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.460144928 |
| Min length | 1 |
Unique
| Unique | 271 ? |
|---|---|
| Unique (%) | 98.2% |
Sample
| 1st row | RWM 20004 |
|---|---|
| 2nd row | RWM 19953 |
| 3rd row | RWM 19978 |
| 4th row | RWM 19932 |
| 5th row | RWM 19955 |
| Value | Count | Frequency (%) |
| rwm | 182 | |
| gmu | 74 | 13.5% |
| lc | 15 | 2.7% |
| 8 | 3 | 0.5% |
| 19897 | 2 | 0.4% |
| 19895 | 1 | 0.2% |
| 19926 | 1 | 0.2% |
| 2430 | 1 | 0.2% |
| 19973 | 1 | 0.2% |
| 19925 | 1 | 0.2% |
| Other values (267) | 267 |
Most occurring characters
| Value | Count | Frequency (%) |
| 272 | ||
| 9 | 260 | |
| M | 257 | |
| 0 | 245 | |
| 1 | 190 | |
| W | 182 | |
| R | 182 | |
| 2 | 165 | |
| 3 | 95 | 4.1% |
| G | 75 | 3.2% |
| Other values (9) | 412 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1262 | |
| Uppercase Letter | 801 | |
| Space Separator | 272 | 11.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 260 | |
| 0 | 245 | |
| 1 | 190 | |
| 2 | 165 | |
| 3 | 95 | 7.5% |
| 7 | 71 | 5.6% |
| 6 | 63 | 5.0% |
| 4 | 62 | 4.9% |
| 8 | 57 | 4.5% |
| 5 | 54 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 257 | |
| W | 182 | |
| R | 182 | |
| G | 75 | 9.4% |
| U | 74 | 9.2% |
| C | 15 | 1.9% |
| L | 15 | 1.9% |
| D | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 272 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1534 | |
| Latin | 801 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 272 | ||
| 9 | 260 | |
| 0 | 245 | |
| 1 | 190 | |
| 2 | 165 | |
| 3 | 95 | 6.2% |
| 7 | 71 | 4.6% |
| 6 | 63 | 4.1% |
| 4 | 62 | 4.0% |
| 8 | 57 | 3.7% |
Latin
| Value | Count | Frequency (%) |
| M | 257 | |
| W | 182 | |
| R | 182 | |
| G | 75 | 9.4% |
| U | 74 | 9.2% |
| C | 15 | 1.9% |
| L | 15 | 1.9% |
| D | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2335 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 272 | ||
| 9 | 260 | |
| M | 257 | |
| 0 | 245 | |
| 1 | 190 | |
| W | 182 | |
| R | 182 | |
| 2 | 165 | |
| 3 | 95 | 4.1% |
| G | 75 | 3.2% |
| Other values (9) | 412 |
individualCount
Text
| Distinct | 158 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 1 |
| Mean length | 1.004863086 |
| Min length | 1 |
Unique
| Unique | 51 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 576101 | |
| 2 | 1312 | 0.2% |
| 0 | 1007 | 0.2% |
| 3 | 830 | 0.1% |
| 5 | 523 | 0.1% |
| 4 | 522 | 0.1% |
| 6 | 386 | 0.1% |
| 7 | 339 | 0.1% |
| 8 | 271 | < 0.1% |
| 10 | 257 | < 0.1% |
| Other values (148) | 2649 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 577649 | |
| 2 | 2199 | 0.4% |
| 0 | 2065 | 0.4% |
| 3 | 1313 | 0.2% |
| 5 | 1043 | 0.2% |
| 4 | 852 | 0.1% |
| 6 | 611 | 0.1% |
| 7 | 518 | 0.1% |
| 8 | 428 | 0.1% |
| 9 | 360 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 587038 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 577649 | |
| 2 | 2199 | 0.4% |
| 0 | 2065 | 0.4% |
| 3 | 1313 | 0.2% |
| 5 | 1043 | 0.2% |
| 4 | 852 | 0.1% |
| 6 | 611 | 0.1% |
| 7 | 518 | 0.1% |
| 8 | 428 | 0.1% |
| 9 | 360 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 587038 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 577649 | |
| 2 | 2199 | 0.4% |
| 0 | 2065 | 0.4% |
| 3 | 1313 | 0.2% |
| 5 | 1043 | 0.2% |
| 4 | 852 | 0.1% |
| 6 | 611 | 0.1% |
| 7 | 518 | 0.1% |
| 8 | 428 | 0.1% |
| 9 | 360 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 587038 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 577649 | |
| 2 | 2199 | 0.4% |
| 0 | 2065 | 0.4% |
| 3 | 1313 | 0.2% |
| 5 | 1043 | 0.2% |
| 4 | 852 | 0.1% |
| 6 | 611 | 0.1% |
| 7 | 518 | 0.1% |
| 8 | 428 | 0.1% |
| 9 | 360 | 0.1% |
sex
Text
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 527948 |
| Missing (%) | 90.4% |
| Memory size | 4.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 4 |
| Mean length | 5.299326258 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Male |
|---|---|
| 2nd row | Male |
| 3rd row | Female |
| 4th row | Male |
| 5th row | Female |
| Value | Count | Frequency (%) |
| male | 29804 | |
| female | 22454 | |
| sex | 3994 | 6.6% |
| unknown | 3994 | 6.6% |
| 108 | 0.2% | |
| hermaphrodite | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 78708 | |
| a | 52259 | |
| l | 52258 | |
| M | 29759 | 10.0% |
| m | 22500 | 7.5% |
| F | 22437 | 7.5% |
| n | 11982 | 4.0% |
| 4102 | 1.4% | |
| o | 3995 | 1.3% |
| w | 3994 | 1.3% |
| Other values (12) | 16109 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 237703 | |
| Uppercase Letter | 56190 | 18.8% |
| Space Separator | 4102 | 1.4% |
| Other Punctuation | 108 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 78708 | |
| a | 52259 | |
| l | 52258 | |
| m | 22500 | 9.5% |
| n | 11982 | 5.0% |
| o | 3995 | 1.7% |
| w | 3994 | 1.7% |
| k | 3994 | 1.7% |
| u | 3994 | 1.7% |
| x | 3994 | 1.7% |
| Other values (7) | 25 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 29759 | |
| F | 22437 | |
| S | 3994 | 7.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4102 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 108 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 293893 | |
| Common | 4210 | 1.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 78708 | |
| a | 52259 | |
| l | 52258 | |
| M | 29759 | 10.1% |
| m | 22500 | 7.7% |
| F | 22437 | 7.6% |
| n | 11982 | 4.1% |
| o | 3995 | 1.4% |
| w | 3994 | 1.4% |
| k | 3994 | 1.4% |
| Other values (10) | 12007 | 4.1% |
Common
| Value | Count | Frequency (%) |
| 4102 | ||
| ? | 108 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 298103 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 78708 | |
| a | 52259 | |
| l | 52258 | |
| M | 29759 | 10.0% |
| m | 22500 | 7.5% |
| F | 22437 | 7.5% |
| n | 11982 | 4.0% |
| 4102 | 1.4% | |
| o | 3995 | 1.3% |
| w | 3994 | 1.3% |
| Other values (12) | 16109 | 5.4% |
lifeStage
Text
Missing 
| Distinct | 240 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 539845 |
| Missing (%) | 92.4% |
| Memory size | 4.5 MiB |
Length
| Max length | 62 |
|---|---|
| Median length | 43 |
| Mean length | 7.309247903 |
| Min length | 3 |
Unique
| Unique | 91 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Metamorph |
|---|---|
| 2nd row | Larva |
| 3rd row | Eggs |
| 4th row | Larva |
| 5th row | Metamorph |
| Value | Count | Frequency (%) |
| juvenile | 20305 | |
| larva | 6310 | 13.6% |
| larvae | 5451 | 11.8% |
| adult | 3825 | 8.3% |
| hatchling | 2086 | 4.5% |
| metamorph | 1344 | 2.9% |
| embryo | 905 | 2.0% |
| eggs | 812 | 1.8% |
| neonate | 578 | 1.2% |
| subadult | 530 | 1.1% |
| Other values (117) | 4162 | 9.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 51203 | |
| v | 32241 | |
| a | 30579 | |
| l | 27879 | |
| u | 25504 | |
| n | 24522 | |
| i | 23249 | 7.2% |
| J | 20353 | 6.3% |
| r | 16052 | 5.0% |
| L | 11780 | 3.6% |
| Other values (53) | 60847 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 277884 | |
| Uppercase Letter | 43941 | 13.6% |
| Space Separator | 1952 | 0.6% |
| Open Punctuation | 122 | < 0.1% |
| Close Punctuation | 122 | < 0.1% |
| Other Punctuation | 87 | < 0.1% |
| Decimal Number | 61 | < 0.1% |
| Dash Punctuation | 35 | < 0.1% |
| Math Symbol | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 51203 | |
| v | 32241 | |
| a | 30579 | |
| l | 27879 | |
| u | 25504 | |
| n | 24522 | |
| i | 23249 | |
| r | 16052 | 5.8% |
| t | 10794 | 3.9% |
| d | 5137 | 1.8% |
| Other values (14) | 30724 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 20353 | |
| L | 11780 | |
| A | 3619 | 8.2% |
| E | 2494 | 5.7% |
| H | 2456 | 5.6% |
| M | 1238 | 2.8% |
| N | 692 | 1.6% |
| S | 554 | 1.3% |
| P | 382 | 0.9% |
| R | 95 | 0.2% |
| Other values (10) | 278 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 26 | |
| 5 | 16 | |
| 1 | 6 | 9.8% |
| 3 | 4 | 6.6% |
| 4 | 4 | 6.6% |
| 6 | 2 | 3.3% |
| 8 | 1 | 1.6% |
| 9 | 1 | 1.6% |
| 0 | 1 | 1.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 31 | |
| ; | 28 | |
| / | 15 | |
| ? | 10 | 11.5% |
| . | 3 | 3.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1952 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 122 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 122 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 35 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 321825 | |
| Common | 2384 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 51203 | |
| v | 32241 | |
| a | 30579 | |
| l | 27879 | |
| u | 25504 | |
| n | 24522 | |
| i | 23249 | 7.2% |
| J | 20353 | 6.3% |
| r | 16052 | 5.0% |
| L | 11780 | 3.7% |
| Other values (34) | 58463 |
Common
| Value | Count | Frequency (%) |
| 1952 | ||
| ( | 122 | 5.1% |
| ) | 122 | 5.1% |
| - | 35 | 1.5% |
| , | 31 | 1.3% |
| ; | 28 | 1.2% |
| 2 | 26 | 1.1% |
| 5 | 16 | 0.7% |
| / | 15 | 0.6% |
| ? | 10 | 0.4% |
| Other values (9) | 27 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 324209 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 51203 | |
| v | 32241 | |
| a | 30579 | |
| l | 27879 | |
| u | 25504 | |
| n | 24522 | |
| i | 23249 | 7.2% |
| J | 20353 | 6.3% |
| r | 16052 | 5.0% |
| L | 11780 | 3.6% |
| Other values (53) | 60847 |
preparations
Text
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5684 |
| Missing (%) | 1.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 7 |
| Mean length | 7.117061383 |
| Min length | 3 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Ethanol |
|---|---|
| 2nd row | Ethanol; Histological Material |
| 3rd row | Ethanol; Dry |
| 4th row | Ethanol |
| 5th row | Ethanol |
| Value | Count | Frequency (%) |
| ethanol | 553871 | |
| dry | 13058 | 2.2% |
| formalin | 8143 | 1.4% |
| cleared | 4474 | 0.8% |
| and | 4474 | 0.8% |
| stained | 4474 | 0.8% |
| histological | 2058 | 0.3% |
| material | 2058 | 0.3% |
| photograph | 126 | < 0.1% |
| sem | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 581736 | |
| l | 572662 | |
| n | 570962 | |
| o | 566382 | |
| t | 562587 | |
| h | 554123 | |
| E | 553874 | |
| r | 27859 | 0.7% |
| i | 18791 | 0.5% |
| e | 15480 | 0.4% |
| Other values (16) | 92885 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3511631 | |
| Uppercase Letter | 588271 | 14.3% |
| Space Separator | 14223 | 0.3% |
| Other Punctuation | 3216 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 581736 | |
| l | 572662 | |
| n | 570962 | |
| o | 566382 | |
| t | 562587 | |
| h | 554123 | |
| r | 27859 | 0.8% |
| i | 18791 | 0.5% |
| e | 15480 | 0.4% |
| d | 13422 | 0.4% |
| Other values (6) | 27627 | 0.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 553874 | |
| D | 13058 | 2.2% |
| F | 8143 | 1.4% |
| S | 4477 | 0.8% |
| C | 4474 | 0.8% |
| M | 2061 | 0.4% |
| H | 2058 | 0.3% |
| P | 126 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 14223 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 3216 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4099902 | |
| Common | 17439 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 581736 | |
| l | 572662 | |
| n | 570962 | |
| o | 566382 | |
| t | 562587 | |
| h | 554123 | |
| E | 553874 | |
| r | 27859 | 0.7% |
| i | 18791 | 0.5% |
| e | 15480 | 0.4% |
| Other values (14) | 75446 | 1.8% |
Common
| Value | Count | Frequency (%) |
| 14223 | ||
| ; | 3216 | 18.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4117341 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 581736 | |
| l | 572662 | |
| n | 570962 | |
| o | 566382 | |
| t | 562587 | |
| h | 554123 | |
| E | 553874 | |
| r | 27859 | 0.7% |
| i | 18791 | 0.5% |
| e | 15480 | 0.4% |
| Other values (16) | 92885 | 2.3% |
associatedMedia
Text
Missing 
| Distinct | 4962 |
|---|---|
| Distinct (%) | 96.4% |
| Missing | 579054 |
| Missing (%) | 99.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 299 |
|---|---|
| Median length | 279 |
| Mean length | 68.65785895 |
| Min length | 48 |
Unique
| Unique | 4905 ? |
|---|---|
| Unique (%) | 95.3% |
Sample
| 1st row | https://collections.nmnh.si.edu/media/?i=14894414; 14895830; 14895831; 14895832; 14895833 |
|---|---|
| 2nd row | https://collections.nmnh.si.edu/media/?i=14589063; 14589068 |
| 3rd row | https://collections.nmnh.si.edu/media/?i=14894289; 14894859; 14894860 |
| 4th row | https://collections.nmnh.si.edu/media/?i=6000993; 6000994; 6000992 |
| 5th row | https://collections.nmnh.si.edu/media/?i=16155167; 16155168; 16155169; 16155170 |
| Value | Count | Frequency (%) |
| https://collections.nmnh.si.edu/media/?i=14580337 | 28 | 0.2% |
| 10295705 | 27 | 0.2% |
| https://collections.nmnh.si.edu/media/?i=10389334 | 27 | 0.2% |
| 10169077 | 19 | 0.1% |
| 10153185 | 18 | 0.1% |
| https://collections.nmnh.si.edu/media/?i=16688871 | 13 | 0.1% |
| 6001652 | 12 | 0.1% |
| https://collections.nmnh.si.edu/media/?i=10690530 | 11 | 0.1% |
| 10690531 | 11 | 0.1% |
| 10295177 | 10 | 0.1% |
| Other values (14831) | 15298 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 22331 | 6.3% |
| / | 20588 | 5.8% |
| i | 20588 | 5.8% |
| 0 | 17642 | 5.0% |
| t | 15441 | 4.4% |
| s | 15441 | 4.4% |
| e | 15441 | 4.4% |
| n | 15441 | 4.4% |
| . | 15441 | 4.4% |
| 2 | 14443 | 4.1% |
| Other values (21) | 180585 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 159557 | |
| Decimal Number | 121701 | |
| Other Punctuation | 56650 | 16.0% |
| Space Separator | 10327 | 2.9% |
| Math Symbol | 5147 | 1.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 20588 | |
| t | 15441 | |
| s | 15441 | |
| e | 15441 | |
| n | 15441 | |
| h | 10294 | 6.5% |
| d | 10294 | 6.5% |
| m | 10294 | 6.5% |
| l | 10294 | 6.5% |
| o | 10294 | 6.5% |
| Other values (4) | 25735 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 22331 | |
| 0 | 17642 | |
| 2 | 14443 | |
| 6 | 12036 | |
| 4 | 11244 | |
| 8 | 9476 | |
| 9 | 9312 | |
| 3 | 9025 | |
| 5 | 8430 | 6.9% |
| 7 | 7762 | 6.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 20588 | |
| . | 15441 | |
| ; | 10327 | |
| ? | 5147 | 9.1% |
| : | 5147 | 9.1% |
Space Separator
| Value | Count | Frequency (%) |
| 10327 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 5147 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 193825 | |
| Latin | 159557 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 22331 | |
| / | 20588 | |
| 0 | 17642 | 9.1% |
| . | 15441 | 8.0% |
| 2 | 14443 | 7.5% |
| 6 | 12036 | 6.2% |
| 4 | 11244 | 5.8% |
| 10327 | 5.3% | |
| ; | 10327 | 5.3% |
| 8 | 9476 | 4.9% |
| Other values (7) | 49970 |
Latin
| Value | Count | Frequency (%) |
| i | 20588 | |
| t | 15441 | |
| s | 15441 | |
| e | 15441 | |
| n | 15441 | |
| h | 10294 | 6.5% |
| d | 10294 | 6.5% |
| m | 10294 | 6.5% |
| l | 10294 | 6.5% |
| o | 10294 | 6.5% |
| Other values (4) | 25735 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 353382 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 22331 | 6.3% |
| / | 20588 | 5.8% |
| i | 20588 | 5.8% |
| 0 | 17642 | 5.0% |
| t | 15441 | 4.4% |
| s | 15441 | 4.4% |
| e | 15441 | 4.4% |
| n | 15441 | 4.4% |
| . | 15441 | 4.4% |
| 2 | 14443 | 4.1% |
| Other values (21) | 180585 |
Missing 
| Distinct | 719 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 583480 |
| Missing (%) | 99.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 699 |
|---|---|
| Median length | 99 |
| Mean length | 112.1983356 |
| Min length | 49 |
Unique
| Unique | 717 ? |
|---|---|
| Unique (%) | 99.4% |
Sample
| 1st row | https://www.ncbi.nlm.nih.gov/gquery?term=AF199141|https://www.ncbi.nlm.nih.gov/gquery?term=AF199204 |
|---|---|
| 2nd row | https://www.ncbi.nlm.nih.gov/gquery?term=OM928184|https://www.ncbi.nlm.nih.gov/gquery?term=OM943246 |
| 3rd row | https://www.ncbi.nlm.nih.gov/gquery?term=JQ914700 |
| 4th row | https://www.ncbi.nlm.nih.gov/gquery?term=FJ613461 |
| 5th row | https://www.ncbi.nlm.nih.gov/gquery?term=FJ766602|https://www.ncbi.nlm.nih.gov/gquery?term=FJ784443 |
| Value | Count | Frequency (%) |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn112709|https://www.ncbi.nlm.nih.gov/gquery?term=jn112771|https://www.ncbi.nlm.nih.gov/gquery?term=jn112642 | 2 | 0.3% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay604497 | 2 | 0.3% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj976636 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn377389|https://www.ncbi.nlm.nih.gov/gquery?term=jn377393|https://www.ncbi.nlm.nih.gov/gquery?term=jn377405 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kc129216|https://www.ncbi.nlm.nih.gov/gquery?term=kc129324 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay604512 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj766829|https://www.ncbi.nlm.nih.gov/gquery?term=fj784465 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=om928184|https://www.ncbi.nlm.nih.gov/gquery?term=om943246 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jq914700 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj613461 | 1 | 0.1% |
| Other values (709) | 709 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 6533 | 8.1% |
| t | 4896 | 6.1% |
| / | 4896 | 6.1% |
| w | 4896 | 6.1% |
| n | 4896 | 6.1% |
| h | 3264 | 4.0% |
| r | 3264 | 4.0% |
| i | 3264 | 4.0% |
| e | 3264 | 4.0% |
| m | 3264 | 4.0% |
| Other values (45) | 38458 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 50592 | |
| Other Punctuation | 14693 | 18.2% |
| Decimal Number | 9801 | 12.1% |
| Uppercase Letter | 3266 | 4.0% |
| Math Symbol | 2543 | 3.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 653 | |
| F | 619 | |
| M | 450 | |
| K | 434 | |
| A | 177 | 5.4% |
| Y | 150 | 4.6% |
| Q | 129 | 3.9% |
| H | 104 | 3.2% |
| N | 86 | 2.6% |
| O | 76 | 2.3% |
| Other values (10) | 388 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4896 | 9.7% |
| w | 4896 | 9.7% |
| n | 4896 | 9.7% |
| h | 3264 | 6.5% |
| r | 3264 | 6.5% |
| i | 3264 | 6.5% |
| e | 3264 | 6.5% |
| m | 3264 | 6.5% |
| g | 3264 | 6.5% |
| q | 1632 | 3.2% |
| Other values (9) | 14688 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 1506 | |
| 6 | 1267 | |
| 7 | 1220 | |
| 8 | 1087 | |
| 3 | 960 | |
| 1 | 840 | |
| 5 | 786 | |
| 2 | 763 | |
| 9 | 763 | |
| 0 | 609 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6533 | |
| / | 4896 | |
| ? | 1632 | 11.1% |
| : | 1632 | 11.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1632 | |
| | | 911 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 53858 | |
| Common | 27037 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 4896 | 9.1% |
| w | 4896 | 9.1% |
| n | 4896 | 9.1% |
| h | 3264 | 6.1% |
| r | 3264 | 6.1% |
| i | 3264 | 6.1% |
| e | 3264 | 6.1% |
| m | 3264 | 6.1% |
| g | 3264 | 6.1% |
| q | 1632 | 3.0% |
| Other values (29) | 17954 |
Common
| Value | Count | Frequency (%) |
| . | 6533 | |
| / | 4896 | |
| = | 1632 | 6.0% |
| ? | 1632 | 6.0% |
| : | 1632 | 6.0% |
| 4 | 1506 | 5.6% |
| 6 | 1267 | 4.7% |
| 7 | 1220 | 4.5% |
| 8 | 1087 | 4.0% |
| 3 | 960 | 3.6% |
| Other values (6) | 4672 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 80895 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 6533 | 8.1% |
| t | 4896 | 6.1% |
| / | 4896 | 6.1% |
| w | 4896 | 6.1% |
| n | 4896 | 6.1% |
| h | 3264 | 4.0% |
| r | 3264 | 4.0% |
| i | 3264 | 4.0% |
| e | 3264 | 4.0% |
| m | 3264 | 4.0% |
| Other values (45) | 38458 |
Missing 
| Distinct | 5339 |
|---|---|
| Distinct (%) | 20.1% |
| Missing | 557618 |
| Missing (%) | 95.4% |
| Memory size | 4.5 MiB |
Length
| Max length | 1294 |
|---|---|
| Median length | 381 |
| Mean length | 66.70947598 |
| Min length | 3 |
Unique
| Unique | 3351 ? |
|---|---|
| Unique (%) | 12.6% |
Sample
| 1st row | Collected from vegetation removal plot (Cocolob 2) in coastal strand Cocolobo uvifera forest, ca. 10 m inland from beach. |
|---|---|
| 2nd row | Collected in roadside ditch in gum/bay swamp. Water depth: 10-40 cm. |
| 3rd row | Complete clutch of eggs removed from the ovaries of a female (Total Length: 57 inches) collected along wooded road. |
| 4th row | Collected on surface at night. |
| 5th row | Collected above and below the falls, south of the creek. |
| Value | Count | Frequency (%) |
| collected | 21028 | 7.1% |
| in | 15429 | 5.2% |
| of | 11658 | 3.9% |
| the | 11088 | 3.7% |
| on | 10611 | 3.6% |
| from | 7596 | 2.6% |
| and | 5597 | 1.9% |
| at | 5284 | 1.8% |
| area | 4127 | 1.4% |
| road | 4049 | 1.4% |
| Other values (6088) | 200792 |
Most occurring characters
| Value | Count | Frequency (%) |
| 270676 | ||
| e | 160711 | 9.1% |
| o | 140496 | 7.9% |
| a | 114427 | 6.5% |
| t | 108900 | 6.1% |
| l | 98347 | 5.5% |
| n | 89158 | 5.0% |
| r | 81415 | 4.6% |
| d | 76949 | 4.3% |
| i | 72320 | 4.1% |
| Other values (81) | 559939 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1313680 | |
| Space Separator | 270676 | 15.3% |
| Uppercase Letter | 64926 | 3.7% |
| Decimal Number | 55444 | 3.1% |
| Other Punctuation | 51683 | 2.9% |
| Open Punctuation | 5630 | 0.3% |
| Close Punctuation | 5620 | 0.3% |
| Dash Punctuation | 5566 | 0.3% |
| Math Symbol | 113 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 160711 | |
| o | 140496 | |
| a | 114427 | 8.7% |
| t | 108900 | 8.3% |
| l | 98347 | 7.5% |
| n | 89158 | 6.8% |
| r | 81415 | 6.2% |
| d | 76949 | 5.9% |
| i | 72320 | 5.5% |
| s | 64649 | 4.9% |
| Other values (23) | 306308 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 24437 | |
| P | 4884 | 7.5% |
| N | 3946 | 6.1% |
| A | 3756 | 5.8% |
| S | 3750 | 5.8% |
| T | 2957 | 4.6% |
| R | 2957 | 4.6% |
| M | 2263 | 3.5% |
| F | 1854 | 2.9% |
| H | 1826 | 2.8% |
| Other values (16) | 12296 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 35953 | |
| , | 7909 | 15.3% |
| : | 3238 | 6.3% |
| " | 1687 | 3.3% |
| ; | 1332 | 2.6% |
| ' | 566 | 1.1% |
| / | 486 | 0.9% |
| % | 225 | 0.4% |
| # | 199 | 0.4% |
| ? | 57 | 0.1% |
| Other values (2) | 31 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 12010 | |
| 0 | 9775 | |
| 2 | 7516 | |
| 9 | 5201 | |
| 8 | 4032 | 7.3% |
| 5 | 3818 | 6.9% |
| 3 | 3764 | 6.8% |
| 7 | 3390 | 6.1% |
| 6 | 3356 | 6.1% |
| 4 | 2582 | 4.7% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 100 | |
| + | 7 | 6.2% |
| < | 4 | 3.5% |
| > | 2 | 1.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5543 | |
| [ | 87 | 1.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5533 | |
| ] | 87 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 270676 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5566 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1378606 | |
| Common | 394732 | 22.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 160711 | |
| o | 140496 | 10.2% |
| a | 114427 | 8.3% |
| t | 108900 | 7.9% |
| l | 98347 | 7.1% |
| n | 89158 | 6.5% |
| r | 81415 | 5.9% |
| d | 76949 | 5.6% |
| i | 72320 | 5.2% |
| s | 64649 | 4.7% |
| Other values (49) | 371234 |
Common
| Value | Count | Frequency (%) |
| 270676 | ||
| . | 35953 | 9.1% |
| 1 | 12010 | 3.0% |
| 0 | 9775 | 2.5% |
| , | 7909 | 2.0% |
| 2 | 7516 | 1.9% |
| - | 5566 | 1.4% |
| ( | 5543 | 1.4% |
| ) | 5533 | 1.4% |
| 9 | 5201 | 1.3% |
| Other values (22) | 29050 | 7.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1773305 | |
| None | 33 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 270676 | ||
| e | 160711 | 9.1% |
| o | 140496 | 7.9% |
| a | 114427 | 6.5% |
| t | 108900 | 6.1% |
| l | 98347 | 5.5% |
| n | 89158 | 5.0% |
| r | 81415 | 4.6% |
| d | 76949 | 4.3% |
| i | 72320 | 4.1% |
| Other values (74) | 559906 |
None
| Value | Count | Frequency (%) |
| ö | 14 | |
| á | 7 | |
| é | 5 | 15.2% |
| ó | 2 | 6.1% |
| ü | 2 | 6.1% |
| è | 2 | 6.1% |
| ñ | 1 | 3.0% |
fieldNumber
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 25.0% |
| Missing | 584193 |
| Missing (%) | > 99.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6.125 |
| Min length | 6 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 12.5% |
Sample
| 1st row | 83-012 |
|---|---|
| 2nd row | 83-012 |
| 3rd row | 83-012 |
| 4th row | 83-012 |
| 5th row | 83-012 |
| Value | Count | Frequency (%) |
| 83-012 | 7 | |
| 83-024a | 1 | 12.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 8 | |
| 3 | 8 | |
| - | 8 | |
| 0 | 8 | |
| 2 | 8 | |
| 1 | 7 | |
| 4 | 1 | 2.0% |
| A | 1 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 40 | |
| Dash Punctuation | 8 | 16.3% |
| Uppercase Letter | 1 | 2.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 8 | |
| 3 | 8 | |
| 0 | 8 | |
| 2 | 8 | |
| 1 | 7 | |
| 4 | 1 | 2.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 48 | |
| Latin | 1 | 2.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 8 | |
| 3 | 8 | |
| - | 8 | |
| 0 | 8 | |
| 2 | 8 | |
| 1 | 7 | |
| 4 | 1 | 2.1% |
Latin
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 8 | |
| 3 | 8 | |
| - | 8 | |
| 0 | 8 | |
| 2 | 8 | |
| 1 | 7 | |
| 4 | 1 | 2.0% |
| A | 1 | 2.0% |
eventDate
Text
Missing 
| Distinct | 31354 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 37781 |
| Missing (%) | 6.5% |
| Memory size | 4.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 10 |
| Mean length | 9.988499689 |
| Min length | 4 |
Unique
| Unique | 7237 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | 1972-02-01/1972-02-03 |
|---|---|
| 2nd row | 1971-09-03 |
| 3rd row | 1992-10-15 |
| 4th row | 1992-06-24 |
| 5th row | 1998-09-03 |
| Value | Count | Frequency (%) |
| 1973-09-22 | 723 | 0.1% |
| 1883 | 697 | 0.1% |
| 1998-10-09 | 690 | 0.1% |
| 1935 | 684 | 0.1% |
| 1971-08-16 | 610 | 0.1% |
| 1966-04-11 | 579 | 0.1% |
| 1970-06-19 | 564 | 0.1% |
| 1976-10-03 | 540 | 0.1% |
| 1971-07-31 | 521 | 0.1% |
| 1969-06-27 | 472 | 0.1% |
| Other values (31319) | 540908 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 1060453 | |
| 1 | 993873 | |
| 0 | 815551 | |
| 9 | 736835 | |
| 2 | 357571 | 6.6% |
| 7 | 295686 | 5.4% |
| 6 | 289115 | 5.3% |
| 8 | 288256 | 5.3% |
| 3 | 212557 | 3.9% |
| 5 | 209699 | 3.8% |
| Other values (6) | 198320 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4378606 | |
| Dash Punctuation | 1060453 | 19.4% |
| Other Punctuation | 17721 | 0.3% |
| Space Separator | 568 | < 0.1% |
| Lowercase Letter | 568 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 993873 | |
| 0 | 815551 | |
| 9 | 736835 | |
| 2 | 357571 | 8.2% |
| 7 | 295686 | 6.8% |
| 6 | 289115 | 6.6% |
| 8 | 288256 | 6.6% |
| 3 | 212557 | 4.9% |
| 5 | 209699 | 4.8% |
| 4 | 179463 | 4.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 17679 | |
| , | 42 | 0.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 284 | |
| r | 284 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1060453 |
Space Separator
| Value | Count | Frequency (%) |
| 568 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5457348 | |
| Latin | 568 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 1060453 | |
| 1 | 993873 | |
| 0 | 815551 | |
| 9 | 736835 | |
| 2 | 357571 | 6.6% |
| 7 | 295686 | 5.4% |
| 6 | 289115 | 5.3% |
| 8 | 288256 | 5.3% |
| 3 | 212557 | 3.9% |
| 5 | 209699 | 3.8% |
| Other values (4) | 197752 | 3.6% |
Latin
| Value | Count | Frequency (%) |
| o | 284 | |
| r | 284 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5457916 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 1060453 | |
| 1 | 993873 | |
| 0 | 815551 | |
| 9 | 736835 | |
| 2 | 357571 | 6.6% |
| 7 | 295686 | 5.4% |
| 6 | 289115 | 5.3% |
| 8 | 288256 | 5.3% |
| 3 | 212557 | 3.9% |
| 5 | 209699 | 3.8% |
| Other values (6) | 198320 | 3.6% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 55728 |
| Missing (%) | 9.5% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.785824441 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 32 |
|---|---|
| 2nd row | 246 |
| 3rd row | 289 |
| 4th row | 176 |
| 5th row | 246 |
| Value | Count | Frequency (%) |
| 151 | 5188 | 1.0% |
| 212 | 5134 | 1.0% |
| 243 | 4767 | 0.9% |
| 181 | 4630 | 0.9% |
| 120 | 3680 | 0.7% |
| 91 | 3127 | 0.6% |
| 90 | 2993 | 0.6% |
| 227 | 2917 | 0.6% |
| 152 | 2853 | 0.5% |
| 230 | 2852 | 0.5% |
| Other values (356) | 490332 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 312432 | |
| 2 | 283563 | |
| 3 | 159914 | |
| 4 | 107096 | 7.3% |
| 0 | 106264 | 7.2% |
| 5 | 103396 | 7.0% |
| 8 | 101181 | 6.9% |
| 9 | 100780 | 6.8% |
| 6 | 99795 | 6.8% |
| 7 | 97812 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1472233 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 312432 | |
| 2 | 283563 | |
| 3 | 159914 | |
| 4 | 107096 | 7.3% |
| 0 | 106264 | 7.2% |
| 5 | 103396 | 7.0% |
| 8 | 101181 | 6.9% |
| 9 | 100780 | 6.8% |
| 6 | 99795 | 6.8% |
| 7 | 97812 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1472233 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 312432 | |
| 2 | 283563 | |
| 3 | 159914 | |
| 4 | 107096 | 7.3% |
| 0 | 106264 | 7.2% |
| 5 | 103396 | 7.0% |
| 8 | 101181 | 6.9% |
| 9 | 100780 | 6.8% |
| 6 | 99795 | 6.8% |
| 7 | 97812 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1472233 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 312432 | |
| 2 | 283563 | |
| 3 | 159914 | |
| 4 | 107096 | 7.3% |
| 0 | 106264 | 7.2% |
| 5 | 103396 | 7.0% |
| 8 | 101181 | 6.9% |
| 9 | 100780 | 6.8% |
| 6 | 99795 | 6.8% |
| 7 | 97812 | 6.6% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 55637 |
| Missing (%) | 9.5% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.786563217 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 34 |
|---|---|
| 2nd row | 246 |
| 3rd row | 289 |
| 4th row | 176 |
| 5th row | 246 |
| Value | Count | Frequency (%) |
| 151 | 5257 | 1.0% |
| 212 | 5068 | 1.0% |
| 243 | 4867 | 0.9% |
| 181 | 4612 | 0.9% |
| 120 | 3515 | 0.7% |
| 91 | 3266 | 0.6% |
| 230 | 3042 | 0.6% |
| 227 | 2924 | 0.6% |
| 59 | 2923 | 0.6% |
| 90 | 2917 | 0.6% |
| Other values (356) | 490173 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 313155 | |
| 2 | 283562 | |
| 3 | 160654 | |
| 4 | 106856 | 7.3% |
| 0 | 106039 | 7.2% |
| 5 | 103988 | 7.1% |
| 8 | 101330 | 6.9% |
| 9 | 101260 | 6.9% |
| 6 | 98792 | 6.7% |
| 7 | 97241 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1472877 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 313155 | |
| 2 | 283562 | |
| 3 | 160654 | |
| 4 | 106856 | 7.3% |
| 0 | 106039 | 7.2% |
| 5 | 103988 | 7.1% |
| 8 | 101330 | 6.9% |
| 9 | 101260 | 6.9% |
| 6 | 98792 | 6.7% |
| 7 | 97241 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1472877 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 313155 | |
| 2 | 283562 | |
| 3 | 160654 | |
| 4 | 106856 | 7.3% |
| 0 | 106039 | 7.2% |
| 5 | 103988 | 7.1% |
| 8 | 101330 | 6.9% |
| 9 | 101260 | 6.9% |
| 6 | 98792 | 6.7% |
| 7 | 97241 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1472877 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 313155 | |
| 2 | 283562 | |
| 3 | 160654 | |
| 4 | 106856 | 7.3% |
| 0 | 106039 | 7.2% |
| 5 | 103988 | 7.1% |
| 8 | 101330 | 6.9% |
| 9 | 101260 | 6.9% |
| 6 | 98792 | 6.7% |
| 7 | 97241 | 6.6% |
year
Text
Missing 
| Distinct | 184 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 37781 |
| Missing (%) | 6.5% |
| Memory size | 4.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1972 |
|---|---|
| 2nd row | 1971 |
| 3rd row | 1992 |
| 4th row | 1992 |
| 5th row | 1998 |
| Value | Count | Frequency (%) |
| 1971 | 17001 | 3.1% |
| 1966 | 15984 | 2.9% |
| 1969 | 15783 | 2.9% |
| 1970 | 15631 | 2.9% |
| 1976 | 15293 | 2.8% |
| 1980 | 15182 | 2.8% |
| 1979 | 14987 | 2.7% |
| 1972 | 14413 | 2.6% |
| 1961 | 12799 | 2.3% |
| 1984 | 12649 | 2.3% |
| Other values (174) | 396698 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 629501 | |
| 1 | 601542 | |
| 7 | 176597 | 8.1% |
| 6 | 174519 | 8.0% |
| 8 | 162711 | 7.4% |
| 0 | 112997 | 5.2% |
| 2 | 90398 | 4.1% |
| 5 | 85543 | 3.9% |
| 3 | 82534 | 3.8% |
| 4 | 69338 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2185680 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 629501 | |
| 1 | 601542 | |
| 7 | 176597 | 8.1% |
| 6 | 174519 | 8.0% |
| 8 | 162711 | 7.4% |
| 0 | 112997 | 5.2% |
| 2 | 90398 | 4.1% |
| 5 | 85543 | 3.9% |
| 3 | 82534 | 3.8% |
| 4 | 69338 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2185680 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 629501 | |
| 1 | 601542 | |
| 7 | 176597 | 8.1% |
| 6 | 174519 | 8.0% |
| 8 | 162711 | 7.4% |
| 0 | 112997 | 5.2% |
| 2 | 90398 | 4.1% |
| 5 | 85543 | 3.9% |
| 3 | 82534 | 3.8% |
| 4 | 69338 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2185680 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 629501 | |
| 1 | 601542 | |
| 7 | 176597 | 8.1% |
| 6 | 174519 | 8.0% |
| 8 | 162711 | 7.4% |
| 0 | 112997 | 5.2% |
| 2 | 90398 | 4.1% |
| 5 | 85543 | 3.9% |
| 3 | 82534 | 3.8% |
| 4 | 69338 | 3.2% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 54300 |
| Missing (%) | 9.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.163641888 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 9 |
| 3rd row | 10 |
| 4th row | 6 |
| 5th row | 9 |
| Value | Count | Frequency (%) |
| 8 | 67636 | |
| 7 | 64391 | |
| 5 | 64348 | |
| 6 | 59630 | |
| 4 | 55731 | |
| 3 | 47041 | |
| 10 | 43052 | |
| 9 | 36733 | |
| 11 | 25771 | 4.9% |
| 2 | 25643 | 4.8% |
| Other values (2) | 39925 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 134519 | |
| 8 | 67636 | |
| 7 | 64391 | |
| 5 | 64348 | |
| 6 | 59630 | |
| 4 | 55731 | |
| 3 | 47041 | 7.6% |
| 2 | 43534 | 7.1% |
| 0 | 43052 | 7.0% |
| 9 | 36733 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 616615 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 134519 | |
| 8 | 67636 | |
| 7 | 64391 | |
| 5 | 64348 | |
| 6 | 59630 | |
| 4 | 55731 | |
| 3 | 47041 | 7.6% |
| 2 | 43534 | 7.1% |
| 0 | 43052 | 7.0% |
| 9 | 36733 | 6.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 616615 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 134519 | |
| 8 | 67636 | |
| 7 | 64391 | |
| 5 | 64348 | |
| 6 | 59630 | |
| 4 | 55731 | |
| 3 | 47041 | 7.6% |
| 2 | 43534 | 7.1% |
| 0 | 43052 | 7.0% |
| 9 | 36733 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 616615 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 134519 | |
| 8 | 67636 | |
| 7 | 64391 | |
| 5 | 64348 | |
| 6 | 59630 | |
| 4 | 55731 | |
| 3 | 47041 | 7.6% |
| 2 | 43534 | 7.1% |
| 0 | 43052 | 7.0% |
| 9 | 36733 | 6.0% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 85891 |
| Missing (%) | 14.7% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.714494993 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 3 |
| 3rd row | 15 |
| 4th row | 24 |
| 5th row | 3 |
| Value | Count | Frequency (%) |
| 15 | 19564 | 3.9% |
| 13 | 17837 | 3.6% |
| 19 | 17383 | 3.5% |
| 21 | 17361 | 3.5% |
| 25 | 17217 | 3.5% |
| 24 | 17005 | 3.4% |
| 3 | 17001 | 3.4% |
| 16 | 16842 | 3.4% |
| 20 | 16734 | 3.4% |
| 28 | 16674 | 3.3% |
| Other values (21) | 324692 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 227367 | |
| 2 | 211630 | |
| 3 | 75423 | 8.8% |
| 5 | 52999 | 6.2% |
| 8 | 48666 | 5.7% |
| 9 | 48337 | 5.7% |
| 0 | 48091 | 5.6% |
| 6 | 47877 | 5.6% |
| 4 | 47391 | 5.5% |
| 7 | 46569 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 854350 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 227367 | |
| 2 | 211630 | |
| 3 | 75423 | 8.8% |
| 5 | 52999 | 6.2% |
| 8 | 48666 | 5.7% |
| 9 | 48337 | 5.7% |
| 0 | 48091 | 5.6% |
| 6 | 47877 | 5.6% |
| 4 | 47391 | 5.5% |
| 7 | 46569 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 854350 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 227367 | |
| 2 | 211630 | |
| 3 | 75423 | 8.8% |
| 5 | 52999 | 6.2% |
| 8 | 48666 | 5.7% |
| 9 | 48337 | 5.7% |
| 0 | 48091 | 5.6% |
| 6 | 47877 | 5.6% |
| 4 | 47391 | 5.5% |
| 7 | 46569 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 854350 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 227367 | |
| 2 | 211630 | |
| 3 | 75423 | 8.8% |
| 5 | 52999 | 6.2% |
| 8 | 48666 | 5.7% |
| 9 | 48337 | 5.7% |
| 0 | 48091 | 5.6% |
| 6 | 47877 | 5.6% |
| 4 | 47391 | 5.5% |
| 7 | 46569 | 5.5% |
| Distinct | 42558 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 51 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 194 |
|---|---|
| Median length | 11 |
| Mean length | 12.14387743 |
| Min length | 4 |
Unique
| Unique | 14192 ? |
|---|---|
| Unique (%) | 2.4% |
Sample
| 1st row | 01-03 February 1972 |
|---|---|
| 2nd row | 3 Sep 1971 |
| 3rd row | -- --- ---- |
| 4th row | 15 Oct 1992; 09:05-13:00 hrs |
| 5th row | 24 Jun 1992; 10:30-11:40 hrs |
| Value | Count | Frequency (%) |
| 173374 | 9.4% | |
| may | 65316 | 3.5% |
| aug | 63760 | 3.5% |
| jul | 58386 | 3.2% |
| jun | 53770 | 2.9% |
| apr | 50984 | 2.8% |
| mar | 43098 | 2.3% |
| oct | 40349 | 2.2% |
| sep | 34295 | 1.9% |
| hrs | 24306 | 1.3% |
| Other values (3264) | 1238022 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1261510 | ||
| 1 | 874532 | 12.3% |
| 9 | 688315 | 9.7% |
| - | 499756 | 7.0% |
| 2 | 328876 | 4.6% |
| 0 | 243409 | 3.4% |
| 6 | 227222 | 3.2% |
| 7 | 227024 | 3.2% |
| 8 | 217953 | 3.1% |
| u | 208644 | 2.9% |
| Other values (64) | 2316605 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3263423 | |
| Lowercase Letter | 1431190 | |
| Space Separator | 1261510 | 17.8% |
| Uppercase Letter | 543907 | 7.7% |
| Dash Punctuation | 499756 | 7.0% |
| Other Punctuation | 92897 | 1.3% |
| Open Punctuation | 581 | < 0.1% |
| Close Punctuation | 581 | < 0.1% |
| Format | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 208644 | |
| r | 158708 | |
| a | 157043 | |
| e | 120723 | |
| n | 97727 | 6.8% |
| p | 94749 | 6.6% |
| y | 81426 | 5.7% |
| l | 78861 | 5.5% |
| g | 78230 | 5.5% |
| c | 71344 | 5.0% |
| Other values (16) | 283735 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 147803 | |
| A | 124847 | |
| M | 113882 | |
| O | 43248 | 8.0% |
| S | 39048 | 7.2% |
| F | 26562 | 4.9% |
| N | 26070 | 4.8% |
| D | 18361 | 3.4% |
| C | 3404 | 0.6% |
| E | 144 | < 0.1% |
| Other values (13) | 538 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 874532 | |
| 9 | 688315 | |
| 2 | 328876 | 10.1% |
| 0 | 243409 | 7.5% |
| 6 | 227222 | 7.0% |
| 7 | 227024 | 7.0% |
| 8 | 217953 | 6.7% |
| 3 | 176664 | 5.4% |
| 5 | 152254 | 4.7% |
| 4 | 127174 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 41924 | |
| ; | 34825 | |
| . | 14987 | 16.1% |
| , | 770 | 0.8% |
| / | 307 | 0.3% |
| ' | 46 | < 0.1% |
| " | 20 | < 0.1% |
| ? | 18 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 580 | |
| [ | 1 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 580 | |
| ] | 1 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1261510 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 499756 |
Format
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5118749 | |
| Latin | 1975097 | 27.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 208644 | 10.6% |
| r | 158708 | 8.0% |
| a | 157043 | 8.0% |
| J | 147803 | 7.5% |
| A | 124847 | 6.3% |
| e | 120723 | 6.1% |
| M | 113882 | 5.8% |
| n | 97727 | 4.9% |
| p | 94749 | 4.8% |
| y | 81426 | 4.1% |
| Other values (39) | 669545 |
Common
| Value | Count | Frequency (%) |
| 1261510 | ||
| 1 | 874532 | |
| 9 | 688315 | |
| - | 499756 | 9.8% |
| 2 | 328876 | 6.4% |
| 0 | 243409 | 4.8% |
| 6 | 227222 | 4.4% |
| 7 | 227024 | 4.4% |
| 8 | 217953 | 4.3% |
| 3 | 176664 | 3.5% |
| Other values (15) | 373488 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7093845 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1261510 | ||
| 1 | 874532 | 12.3% |
| 9 | 688315 | 9.7% |
| - | 499756 | 7.0% |
| 2 | 328876 | 4.6% |
| 0 | 243409 | 3.4% |
| 6 | 227222 | 3.2% |
| 7 | 227024 | 3.2% |
| 8 | 217953 | 3.1% |
| u | 208644 | 2.9% |
| Other values (63) | 2316604 |
None
| Value | Count | Frequency (%) |
| | 1 |
higherGeography
Text
| Distinct | 6286 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 4414 |
| Missing (%) | 0.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 167 |
|---|---|
| Median length | 118 |
| Mean length | 48.81643259 |
| Min length | 4 |
Unique
| Unique | 1092 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Oceania, Papua New Guinea, Central Province, Kairuku-Hiri District, New Guinea |
|---|---|
| 2nd row | North America, United States, North Carolina, Buncombe - Yancey |
| 3rd row | Oceania, Pacific Ocean , Tonga, Tonga Islands, Tongatapu Island Group, Tonga Islands |
| 4th row | North America, Grenada, St. George Parish, Lesser Antilles, Windward Islands, Grenada Island |
| 5th row | North America, United States, Virginia, Augusta |
| Value | Count | Frequency (%) |
| america | 483266 | 12.9% |
| north | 476209 | 12.7% |
| states | 351020 | 9.4% |
| united | 349359 | 9.4% |
| virginia | 96173 | 2.6% |
| south | 71896 | 1.9% |
| islands | 71471 | 1.9% |
| carolina | 61728 | 1.7% |
| 54664 | 1.5% | |
| asia | 39306 | 1.1% |
| Other values (4622) | 1680221 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3155526 | 11.1% | |
| a | 2668328 | 9.4% |
| i | 2176293 | 7.7% |
| e | 2119779 | 7.5% |
| t | 1973350 | 7.0% |
| r | 1844062 | 6.5% |
| , | 1669519 | 5.9% |
| n | 1511861 | 5.3% |
| o | 1298349 | 4.6% |
| s | 1011828 | 3.6% |
| Other values (73) | 8874238 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19740985 | |
| Uppercase Letter | 3656252 | 12.9% |
| Space Separator | 3155526 | 11.1% |
| Other Punctuation | 1685470 | 6.0% |
| Dash Punctuation | 42316 | 0.1% |
| Open Punctuation | 11057 | < 0.1% |
| Close Punctuation | 11052 | < 0.1% |
| Math Symbol | 409 | < 0.1% |
| Decimal Number | 64 | < 0.1% |
| Modifier Letter | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2668328 | |
| i | 2176293 | |
| e | 2119779 | |
| t | 1973350 | |
| r | 1844062 | |
| n | 1511861 | |
| o | 1298349 | 6.6% |
| s | 1011828 | 5.1% |
| c | 896985 | 4.5% |
| h | 740667 | 3.8% |
| Other values (28) | 3499483 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 644681 | |
| N | 528051 | |
| S | 527098 | |
| U | 359922 | |
| P | 226035 | 6.2% |
| C | 185016 | 5.1% |
| M | 170358 | 4.7% |
| I | 135691 | 3.7% |
| V | 116151 | 3.2% |
| G | 115778 | 3.2% |
| Other values (18) | 647471 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1669519 | |
| . | 13671 | 0.8% |
| ' | 2228 | 0.1% |
| ? | 41 | < 0.1% |
| / | 11 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 42077 | |
| – | 239 | 0.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10381 | |
| [ | 676 | 6.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10376 | |
| ] | 676 | 6.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 389 | |
| + | 20 | 4.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 32 | |
| 0 | 32 |
Space Separator
| Value | Count | Frequency (%) |
| 3155526 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23397237 | |
| Common | 4905896 | 17.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2668328 | 11.4% |
| i | 2176293 | 9.3% |
| e | 2119779 | 9.1% |
| t | 1973350 | 8.4% |
| r | 1844062 | 7.9% |
| n | 1511861 | 6.5% |
| o | 1298349 | 5.5% |
| s | 1011828 | 4.3% |
| c | 896985 | 3.8% |
| h | 740667 | 3.2% |
| Other values (56) | 7155735 |
Common
| Value | Count | Frequency (%) |
| 3155526 | ||
| , | 1669519 | |
| - | 42077 | 0.9% |
| . | 13671 | 0.3% |
| ( | 10381 | 0.2% |
| ) | 10376 | 0.2% |
| ' | 2228 | < 0.1% |
| [ | 676 | < 0.1% |
| ] | 676 | < 0.1% |
| = | 389 | < 0.1% |
| Other values (7) | 377 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28276056 | |
| None | 26786 | 0.1% |
| Punctuation | 239 | < 0.1% |
| Latin Ext Additional | 50 | < 0.1% |
| Modifier Letters | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3155526 | 11.2% | |
| a | 2668328 | 9.4% |
| i | 2176293 | 7.7% |
| e | 2119779 | 7.5% |
| t | 1973350 | 7.0% |
| r | 1844062 | 6.5% |
| , | 1669519 | 5.9% |
| n | 1511861 | 5.3% |
| o | 1298349 | 4.6% |
| s | 1011828 | 3.6% |
| Other values (57) | 8847161 |
None
| Value | Count | Frequency (%) |
| é | 6953 | |
| á | 5925 | |
| ã | 4537 | |
| í | 4305 | |
| ó | 3223 | |
| ô | 1182 | 4.4% |
| ñ | 439 | 1.6% |
| â | 51 | 0.2% |
| Đ | 50 | 0.2% |
| ı | 48 | 0.2% |
| Other values (3) | 73 | 0.3% |
Punctuation
| Value | Count | Frequency (%) |
| – | 239 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 50 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 2 |
continent
Text
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4673 |
| Missing (%) | 0.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 13 |
| Mean length | 12.48592475 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Oceania |
|---|---|
| 2nd row | North America |
| 3rd row | Oceania, Pacific Ocean |
| 4th row | North America |
| 5th row | North America |
| Value | Count | Frequency (%) |
| america | 483251 | |
| north | 418511 | |
| south | 64740 | 5.8% |
| asia | 39303 | 3.5% |
| oceania | 32002 | 2.9% |
| ocean | 28207 | 2.5% |
| pacific | 26665 | 2.4% |
| africa | 20689 | 1.8% |
| europe | 2403 | 0.2% |
| australia | 1401 | 0.1% |
| Other values (2) | 1542 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 926255 | |
| a | 666463 | |
| i | 631518 | |
| c | 617823 | |
| e | 545863 | |
| A | 544988 | |
| 539186 | ||
| o | 485654 | 6.7% |
| t | 485340 | 6.7% |
| h | 483251 | 6.7% |
| Other values (15) | 1309602 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5550315 | |
| Uppercase Letter | 1118714 | 15.5% |
| Space Separator | 539186 | 7.5% |
| Other Punctuation | 27728 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 926255 | |
| a | 666463 | |
| i | 631518 | |
| c | 617823 | |
| e | 545863 | |
| o | 485654 | |
| t | 485340 | |
| h | 483251 | |
| m | 483251 | |
| u | 68544 | 1.2% |
| Other values (6) | 156353 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 544988 | |
| N | 418511 | |
| S | 64740 | 5.8% |
| O | 60209 | 5.4% |
| P | 26665 | 2.4% |
| E | 2403 | 0.2% |
| I | 1198 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 539186 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 27728 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6669029 | |
| Common | 566914 | 7.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 926255 | |
| a | 666463 | |
| i | 631518 | |
| c | 617823 | |
| e | 545863 | |
| A | 544988 | |
| o | 485654 | |
| t | 485340 | |
| h | 483251 | |
| m | 483251 | |
| Other values (13) | 798623 |
Common
| Value | Count | Frequency (%) |
| 539186 | ||
| , | 27728 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7235943 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 926255 | |
| a | 666463 | |
| i | 631518 | |
| c | 617823 | |
| e | 545863 | |
| A | 544988 | |
| 539186 | ||
| o | 485654 | 6.7% |
| t | 485340 | 6.7% |
| h | 483251 | 6.7% |
| Other values (15) | 1309602 |
waterBody
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 555994 |
| Missing (%) | 95.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 12.96972383 |
| Min length | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Pacific Ocean |
|---|---|
| 2nd row | Pacific Ocean |
| 3rd row | Pacific Ocean |
| 4th row | Pacific Ocean |
| 5th row | Indian Ocean |
| Value | Count | Frequency (%) |
| ocean | 28207 | |
| pacific | 26665 | |
| indian | 1198 | 2.1% |
| atlantic | 344 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 81881 | |
| a | 56414 | |
| i | 54872 | |
| n | 30947 | 8.5% |
| 28207 | 7.7% | |
| O | 28207 | 7.7% |
| e | 28207 | 7.7% |
| P | 26665 | 7.3% |
| f | 26665 | 7.3% |
| I | 1198 | 0.3% |
| Other values (4) | 2574 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 281216 | |
| Uppercase Letter | 56414 | 15.4% |
| Space Separator | 28207 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 81881 | |
| a | 56414 | |
| i | 54872 | |
| n | 30947 | 11.0% |
| e | 28207 | 10.0% |
| f | 26665 | 9.5% |
| d | 1198 | 0.4% |
| t | 688 | 0.2% |
| l | 344 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 28207 | |
| P | 26665 | |
| I | 1198 | 2.1% |
| A | 344 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 28207 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 337630 | |
| Common | 28207 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 81881 | |
| a | 56414 | |
| i | 54872 | |
| n | 30947 | 9.2% |
| O | 28207 | 8.4% |
| e | 28207 | 8.4% |
| P | 26665 | 7.9% |
| f | 26665 | 7.9% |
| I | 1198 | 0.4% |
| d | 1198 | 0.4% |
| Other values (3) | 1376 | 0.4% |
Common
| Value | Count | Frequency (%) |
| 28207 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 365837 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 81881 | |
| a | 56414 | |
| i | 54872 | |
| n | 30947 | 8.5% |
| 28207 | 7.7% | |
| O | 28207 | 7.7% |
| e | 28207 | 7.7% |
| P | 26665 | 7.3% |
| f | 26665 | 7.3% |
| I | 1198 | 0.3% |
| Other values (4) | 2574 | 0.7% |
islandGroup
Text
Missing 
| Distinct | 41 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 564324 |
| Missing (%) | 96.6% |
| Memory size | 4.5 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 25 |
| Mean length | 13.3327967 |
| Min length | 10 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Windward Islands |
|---|---|
| 2nd row | Virgin Islands |
| 3rd row | Hispaniola |
| 4th row | Hispaniola |
| 5th row | Greater Sunda Islands |
| Value | Count | Frequency (%) |
| islands | 10225 | |
| hispaniola | 8927 | |
| virgin | 2527 | 7.7% |
| windward | 2377 | 7.2% |
| bahama | 1504 | 4.6% |
| leeward | 1357 | 4.1% |
| sunda | 1019 | 3.1% |
| greater | 1018 | 3.1% |
| northern | 671 | 2.0% |
| solomon | 655 | 2.0% |
| Other values (48) | 2663 | 8.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 41073 | |
| s | 30081 | |
| n | 27407 | |
| i | 26949 | |
| l | 20663 | 7.8% |
| d | 17902 | 6.8% |
| 13066 | 4.9% | |
| o | 12195 | 4.6% |
| r | 10747 | 4.1% |
| I | 10283 | 3.9% |
| Other values (35) | 54650 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 218943 | |
| Uppercase Letter | 32978 | 12.4% |
| Space Separator | 13066 | 4.9% |
| Open Punctuation | 8 | < 0.1% |
| Math Symbol | 8 | < 0.1% |
| Close Punctuation | 8 | < 0.1% |
| Other Punctuation | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 41073 | |
| s | 30081 | |
| n | 27407 | |
| i | 26949 | |
| l | 20663 | |
| d | 17902 | |
| o | 12195 | 5.6% |
| r | 10747 | 4.9% |
| p | 9142 | 4.2% |
| e | 5828 | 2.7% |
| Other values (13) | 16956 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 10283 | |
| H | 8927 | |
| V | 2533 | 7.7% |
| W | 2377 | 7.2% |
| S | 1736 | 5.3% |
| B | 1647 | 5.0% |
| L | 1372 | 4.2% |
| G | 1061 | 3.2% |
| C | 934 | 2.8% |
| N | 775 | 2.4% |
| Other values (7) | 1333 | 4.0% |
Space Separator
| Value | Count | Frequency (%) |
| 13066 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 8 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 251921 | |
| Common | 13095 | 4.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 41073 | |
| s | 30081 | |
| n | 27407 | |
| i | 26949 | |
| l | 20663 | |
| d | 17902 | 7.1% |
| o | 12195 | 4.8% |
| r | 10747 | 4.3% |
| I | 10283 | 4.1% |
| p | 9142 | 3.6% |
| Other values (30) | 45479 |
Common
| Value | Count | Frequency (%) |
| 13066 | ||
| ( | 8 | 0.1% |
| = | 8 | 0.1% |
| ) | 8 | 0.1% |
| . | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 265016 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 41073 | |
| s | 30081 | |
| n | 27407 | |
| i | 26949 | |
| l | 20663 | 7.8% |
| d | 17902 | 6.8% |
| 13066 | 4.9% | |
| o | 12195 | 4.6% |
| r | 10747 | 4.1% |
| I | 10283 | 3.9% |
| Other values (35) | 54650 |
island
Text
Missing 
| Distinct | 39 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 576136 |
| Missing (%) | 98.6% |
| Memory size | 4.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 10 |
| Mean length | 10.77445753 |
| Min length | 6 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | New Guinea |
|---|---|
| 2nd row | Grenada Island |
| 3rd row | New Guinea |
| 4th row | New Guinea |
| 5th row | Little Swan Island |
| Value | Count | Frequency (%) |
| new | 4350 | |
| guinea | 4350 | |
| island | 1306 | 8.7% |
| borneo | 712 | 4.7% |
| bougainville | 652 | 4.3% |
| sumatra | 558 | 3.7% |
| okinawa | 493 | 3.3% |
| grenada | 267 | 1.8% |
| isla | 258 | 1.7% |
| swan | 241 | 1.6% |
| Other values (44) | 1803 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 11374 | |
| a | 10388 | |
| n | 8928 | |
| 6925 | 8.0% | |
| i | 6731 | 7.7% |
| u | 5716 | 6.6% |
| w | 5086 | 5.9% |
| G | 4959 | 5.7% |
| N | 4459 | 5.1% |
| l | 3060 | 3.5% |
| Other values (34) | 19270 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 65206 | |
| Uppercase Letter | 14765 | 17.0% |
| Space Separator | 6925 | 8.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11374 | |
| a | 10388 | |
| n | 8928 | |
| i | 6731 | |
| u | 5716 | |
| w | 5086 | |
| l | 3060 | 4.7% |
| o | 2768 | 4.2% |
| d | 2350 | 3.6% |
| s | 2071 | 3.2% |
| Other values (14) | 6734 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 4959 | |
| N | 4459 | |
| I | 1683 | 11.4% |
| B | 1407 | 9.5% |
| S | 841 | 5.7% |
| O | 512 | 3.5% |
| U | 199 | 1.3% |
| K | 190 | 1.3% |
| L | 178 | 1.2% |
| R | 151 | 1.0% |
| Other values (9) | 186 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 6925 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 79971 | |
| Common | 6925 | 8.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 11374 | |
| a | 10388 | |
| n | 8928 | |
| i | 6731 | |
| u | 5716 | 7.1% |
| w | 5086 | 6.4% |
| G | 4959 | 6.2% |
| N | 4459 | 5.6% |
| l | 3060 | 3.8% |
| o | 2768 | 3.5% |
| Other values (33) | 16502 |
Common
| Value | Count | Frequency (%) |
| 6925 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 86748 | |
| None | 148 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 11374 | |
| a | 10388 | |
| n | 8928 | |
| 6925 | 8.0% | |
| i | 6731 | 7.8% |
| u | 5716 | 6.6% |
| w | 5086 | 5.9% |
| G | 4959 | 5.7% |
| N | 4459 | 5.1% |
| l | 3060 | 3.5% |
| Other values (33) | 19122 |
None
| Value | Count | Frequency (%) |
| á | 148 |
country
Text
| Distinct | 235 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5014 |
| Missing (%) | 0.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 44 |
|---|---|
| Median length | 13 |
| Mean length | 11.36707143 |
| Min length | 4 |
Unique
| Unique | 16 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Papua New Guinea |
|---|---|
| 2nd row | United States |
| 3rd row | Tonga |
| 4th row | Grenada |
| 5th row | United States |
| Value | Count | Frequency (%) |
| states | 351011 | |
| united | 349162 | |
| mexico | 22872 | 2.3% |
| ecuador | 16235 | 1.6% |
| brazil | 14751 | 1.5% |
| territory | 13632 | 1.4% |
| peru | 12875 | 1.3% |
| philippines | 11392 | 1.1% |
| honduras | 10938 | 1.1% |
| panama | 7692 | 0.8% |
| Other values (255) | 187970 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 1097918 | |
| e | 826989 | |
| a | 632853 | |
| i | 556170 | |
| n | 472266 | |
| 419343 | 6.4% | |
| s | 408842 | 6.2% |
| d | 407821 | 6.2% |
| S | 361495 | 5.5% |
| U | 349907 | 5.3% |
| Other values (51) | 1050056 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5166158 | |
| Uppercase Letter | 990793 | 15.0% |
| Space Separator | 419343 | 6.4% |
| Other Punctuation | 4735 | 0.1% |
| Open Punctuation | 1221 | < 0.1% |
| Close Punctuation | 1221 | < 0.1% |
| Dash Punctuation | 189 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1097918 | |
| e | 826989 | |
| a | 632853 | |
| i | 556170 | |
| n | 472266 | |
| s | 408842 | 7.9% |
| d | 407821 | 7.9% |
| r | 137195 | 2.7% |
| o | 125533 | 2.4% |
| u | 92494 | 1.8% |
| Other values (19) | 408077 | 7.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 361495 | |
| U | 349907 | |
| P | 45641 | 4.6% |
| M | 31265 | 3.2% |
| T | 27087 | 2.7% |
| B | 26397 | 2.7% |
| C | 25077 | 2.5% |
| E | 19880 | 2.0% |
| H | 14756 | 1.5% |
| G | 12513 | 1.3% |
| Other values (14) | 76775 | 7.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3713 | |
| . | 1022 | 21.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 676 | |
| ( | 545 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 676 | |
| ) | 545 |
Space Separator
| Value | Count | Frequency (%) |
| 419343 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 189 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6156951 | |
| Common | 426709 | 6.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1097918 | |
| e | 826989 | |
| a | 632853 | |
| i | 556170 | |
| n | 472266 | |
| s | 408842 | 6.6% |
| d | 407821 | 6.6% |
| S | 361495 | 5.9% |
| U | 349907 | 5.7% |
| r | 137195 | 2.2% |
| Other values (43) | 905495 |
Common
| Value | Count | Frequency (%) |
| 419343 | ||
| , | 3713 | 0.9% |
| . | 1022 | 0.2% |
| [ | 676 | 0.2% |
| ] | 676 | 0.2% |
| ( | 545 | 0.1% |
| ) | 545 | 0.1% |
| - | 189 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6581073 | |
| None | 2587 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 1097918 | |
| e | 826989 | |
| a | 632853 | |
| i | 556170 | |
| n | 472266 | |
| 419343 | 6.4% | |
| s | 408842 | 6.2% |
| d | 407821 | 6.2% |
| S | 361495 | 5.5% |
| U | 349907 | 5.3% |
| Other values (48) | 1047469 |
None
| Value | Count | Frequency (%) |
| é | 893 | |
| í | 847 | |
| ã | 847 |
stateProvince
Text
Missing 
| Distinct | 2059 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 17001 |
| Missing (%) | 2.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 52 |
| Mean length | 10.58665021 |
| Min length | 3 |
Unique
| Unique | 356 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Central Province |
|---|---|
| 2nd row | North Carolina |
| 3rd row | Tonga Islands |
| 4th row | St. George Parish |
| 5th row | Virginia |
| Value | Count | Frequency (%) |
| virginia | 93314 | 11.0% |
| carolina | 61709 | 7.2% |
| north | 57614 | 6.8% |
| maryland | 32649 | 3.8% |
| province | 27443 | 3.2% |
| pennsylvania | 18911 | 2.2% |
| west | 18140 | 2.1% |
| florida | 18100 | 2.1% |
| island | 18015 | 2.1% |
| tennessee | 17444 | 2.0% |
| Other values (1937) | 487863 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 826291 | |
| i | 632216 | 10.5% |
| n | 557794 | 9.3% |
| r | 474453 | 7.9% |
| o | 407390 | 6.8% |
| e | 304504 | 5.1% |
| 284002 | 4.7% | |
| l | 264922 | 4.4% |
| s | 256173 | 4.3% |
| t | 191100 | 3.2% |
| Other values (62) | 1805903 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4862616 | |
| Uppercase Letter | 830502 | 13.8% |
| Space Separator | 284002 | 4.7% |
| Dash Punctuation | 16262 | 0.3% |
| Other Punctuation | 9979 | 0.2% |
| Open Punctuation | 537 | < 0.1% |
| Close Punctuation | 532 | < 0.1% |
| Math Symbol | 318 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 826291 | |
| i | 632216 | |
| n | 557794 | |
| r | 474453 | |
| o | 407390 | |
| e | 304504 | 6.3% |
| l | 264922 | 5.4% |
| s | 256173 | 5.3% |
| t | 191100 | 3.9% |
| g | 142256 | 2.9% |
| Other values (24) | 805517 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 108499 | |
| V | 99187 | |
| P | 90787 | |
| N | 84504 | |
| M | 71209 | |
| I | 44333 | 5.3% |
| S | 44167 | 5.3% |
| T | 42704 | 5.1% |
| G | 35681 | 4.3% |
| A | 34622 | 4.2% |
| Other values (17) | 174809 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 9193 | |
| ' | 757 | 7.6% |
| ? | 19 | 0.2% |
| / | 6 | 0.1% |
| , | 4 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 298 | |
| + | 20 | 6.3% |
Space Separator
| Value | Count | Frequency (%) |
| 284002 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16262 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 537 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 532 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5693118 | |
| Common | 311630 | 5.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 826291 | |
| i | 632216 | 11.1% |
| n | 557794 | 9.8% |
| r | 474453 | 8.3% |
| o | 407390 | 7.2% |
| e | 304504 | 5.3% |
| l | 264922 | 4.7% |
| s | 256173 | 4.5% |
| t | 191100 | 3.4% |
| g | 142256 | 2.5% |
| Other values (51) | 1636019 |
Common
| Value | Count | Frequency (%) |
| 284002 | ||
| - | 16262 | 5.2% |
| . | 9193 | 2.9% |
| ' | 757 | 0.2% |
| ( | 537 | 0.2% |
| ) | 532 | 0.2% |
| = | 298 | 0.1% |
| + | 20 | < 0.1% |
| ? | 19 | < 0.1% |
| / | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5984875 | |
| None | 19873 | 0.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 826291 | |
| i | 632216 | 10.6% |
| n | 557794 | 9.3% |
| r | 474453 | 7.9% |
| o | 407390 | 6.8% |
| e | 304504 | 5.1% |
| 284002 | 4.7% | |
| l | 264922 | 4.4% |
| s | 256173 | 4.3% |
| t | 191100 | 3.2% |
| Other values (53) | 1786030 |
None
| Value | Count | Frequency (%) |
| á | 4907 | |
| é | 4585 | |
| ã | 3690 | |
| ó | 2908 | |
| í | 2325 | |
| ô | 1036 | 5.2% |
| ñ | 367 | 1.8% |
| ı | 48 | 0.2% |
| Î | 7 | < 0.1% |
county
Text
Missing 
| Distinct | 3056 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 191557 |
| Missing (%) | 32.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 56 |
|---|---|
| Median length | 43 |
| Mean length | 9.394395432 |
| Min length | 3 |
Unique
| Unique | 504 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Kairuku-Hiri District |
|---|---|
| 2nd row | Buncombe - Yancey |
| 3rd row | Tongatapu Island Group |
| 4th row | Augusta |
| 5th row | Elko |
| Value | Count | Frequency (%) |
| 21119 | 3.8% | |
| island | 14180 | 2.6% |
| swain | 12742 | 2.3% |
| city | 8568 | 1.6% |
| province | 8458 | 1.5% |
| giles | 8024 | 1.5% |
| frederick | 7508 | 1.4% |
| macon | 7377 | 1.3% |
| municipality | 7367 | 1.3% |
| haywood | 7297 | 1.3% |
| Other values (2826) | 448585 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 361375 | 9.8% |
| e | 318401 | 8.6% |
| n | 281913 | 7.6% |
| o | 250126 | 6.8% |
| i | 237836 | 6.4% |
| r | 221961 | 6.0% |
| l | 181195 | 4.9% |
| 158581 | 4.3% | |
| s | 154891 | 4.2% |
| t | 142082 | 3.9% |
| Other values (64) | 1380292 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2956891 | |
| Uppercase Letter | 526246 | 14.3% |
| Space Separator | 158581 | 4.3% |
| Dash Punctuation | 25865 | 0.7% |
| Close Punctuation | 7839 | 0.2% |
| Open Punctuation | 7839 | 0.2% |
| Other Punctuation | 5243 | 0.1% |
| Math Symbol | 83 | < 0.1% |
| Decimal Number | 64 | < 0.1% |
| Modifier Letter | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 361375 | |
| e | 318401 | |
| n | 281913 | |
| o | 250126 | 8.5% |
| i | 237836 | 8.0% |
| r | 221961 | 7.5% |
| l | 181195 | 6.1% |
| s | 154891 | 5.2% |
| t | 142082 | 4.8% |
| c | 111847 | 3.8% |
| Other values (25) | 695264 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 56403 | 10.7% |
| S | 49852 | 9.5% |
| C | 48154 | 9.2% |
| P | 46114 | 8.8% |
| G | 36649 | 7.0% |
| B | 29767 | 5.7% |
| I | 27305 | 5.2% |
| A | 26724 | 5.1% |
| H | 24796 | 4.7% |
| R | 21003 | 4.0% |
| Other values (15) | 159479 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3451 | |
| ' | 1471 | |
| , | 294 | 5.6% |
| ? | 22 | 0.4% |
| / | 5 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25626 | |
| – | 239 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 32 | |
| 0 | 32 |
Space Separator
| Value | Count | Frequency (%) |
| 158581 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 7839 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 7839 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 83 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3483137 | |
| Common | 205516 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 361375 | 10.4% |
| e | 318401 | 9.1% |
| n | 281913 | 8.1% |
| o | 250126 | 7.2% |
| i | 237836 | 6.8% |
| r | 221961 | 6.4% |
| l | 181195 | 5.2% |
| s | 154891 | 4.4% |
| t | 142082 | 4.1% |
| c | 111847 | 3.2% |
| Other values (50) | 1221510 |
Common
| Value | Count | Frequency (%) |
| 158581 | ||
| - | 25626 | 12.5% |
| ) | 7839 | 3.8% |
| ( | 7839 | 3.8% |
| . | 3451 | 1.7% |
| ' | 1471 | 0.7% |
| , | 294 | 0.1% |
| – | 239 | 0.1% |
| = | 83 | < 0.1% |
| 1 | 32 | < 0.1% |
| Other values (4) | 61 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3684587 | |
| None | 3825 | 0.1% |
| Punctuation | 239 | < 0.1% |
| Modifier Letters | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 361375 | 9.8% |
| e | 318401 | 8.6% |
| n | 281913 | 7.7% |
| o | 250126 | 6.8% |
| i | 237836 | 6.5% |
| r | 221961 | 6.0% |
| l | 181195 | 4.9% |
| 158581 | 4.3% | |
| s | 154891 | 4.2% |
| t | 142082 | 3.9% |
| Other values (53) | 1376226 |
None
| Value | Count | Frequency (%) |
| é | 1444 | |
| í | 911 | |
| á | 870 | |
| ó | 315 | 8.2% |
| ô | 96 | 2.5% |
| ñ | 72 | 1.9% |
| â | 51 | 1.3% |
| ü | 38 | 1.0% |
| è | 28 | 0.7% |
Punctuation
| Value | Count | Frequency (%) |
| – | 239 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 2 |
locality
Text
| Distinct | 56650 |
|---|---|
| Distinct (%) | 9.7% |
| Missing | 2303 |
| Missing (%) | 0.4% |
| Memory size | 4.5 MiB |
Length
| Max length | 295 |
|---|---|
| Median length | 193 |
| Mean length | 54.40064066 |
| Min length | 2 |
Unique
| Unique | 25059 ? |
|---|---|
| Unique (%) | 4.3% |
Sample
| 1st row | Kairuku, Yule Island |
|---|---|
| 2nd row | Pisgah National Forest, near Cane River Gap |
| 3rd row | No Locality Data |
| 4th row | Tongatapu Island, adjacent to Fua'amotu Airport |
| 5th row | Grand Anse Bay, west end of, along road to jetty just east of base of Quarantine Point |
| Value | Count | Frequency (%) |
| of | 456712 | 8.0% |
| mi | 190409 | 3.3% |
| road | 182915 | 3.2% |
| route | 156226 | 2.7% |
| on | 147202 | 2.6% |
| national | 106083 | 1.8% |
| by | 93415 | 1.6% |
| forest | 89661 | 1.6% |
| junction | 81776 | 1.4% |
| km | 68711 | 1.2% |
| Other values (30771) | 4165761 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5156973 | ||
| a | 2379871 | 7.5% |
| o | 2379316 | 7.5% |
| e | 1741456 | 5.5% |
| n | 1661294 | 5.2% |
| i | 1563818 | 4.9% |
| t | 1519591 | 4.8% |
| r | 1285474 | 4.1% |
| l | 960809 | 3.0% |
| , | 845140 | 2.7% |
| Other values (100) | 12161882 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19579495 | |
| Space Separator | 5156973 | 16.3% |
| Uppercase Letter | 4046777 | 12.8% |
| Other Punctuation | 1240931 | 3.9% |
| Decimal Number | 1169470 | 3.7% |
| Open Punctuation | 200092 | 0.6% |
| Close Punctuation | 200069 | 0.6% |
| Dash Punctuation | 36149 | 0.1% |
| Math Symbol | 25534 | 0.1% |
| Format | 126 | < 0.1% |
| Other values (2) | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2379871 | |
| o | 2379316 | |
| e | 1741456 | 8.9% |
| n | 1661294 | 8.5% |
| i | 1563818 | 8.0% |
| t | 1519591 | 7.8% |
| r | 1285474 | 6.6% |
| l | 960809 | 4.9% |
| u | 734887 | 3.8% |
| s | 723098 | 3.7% |
| Other values (38) | 4629881 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 450616 | 11.1% |
| S | 430589 | 10.6% |
| N | 383011 | 9.5% |
| C | 258686 | 6.4% |
| M | 234251 | 5.8% |
| E | 231690 | 5.7% |
| W | 220124 | 5.4% |
| P | 210033 | 5.2% |
| A | 197529 | 4.9% |
| F | 190404 | 4.7% |
| Other values (18) | 1239844 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 845140 | |
| . | 367701 | |
| ' | 12150 | 1.0% |
| ; | 6267 | 0.5% |
| / | 6172 | 0.5% |
| " | 1760 | 0.1% |
| : | 693 | 0.1% |
| ? | 661 | 0.1% |
| # | 351 | < 0.1% |
| & | 36 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 222198 | |
| 0 | 175576 | |
| 2 | 157851 | |
| 5 | 124586 | |
| 3 | 116675 | |
| 6 | 105408 | |
| 4 | 93349 | |
| 7 | 69091 | 5.9% |
| 8 | 55301 | 4.7% |
| 9 | 49435 | 4.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 200029 | |
| [ | 62 | < 0.1% |
| ‚ | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 24365 | |
| + | 1165 | 4.6% |
| < | 4 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 200007 | |
| ] | 62 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 36140 | |
| – | 9 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 5156973 |
Format
| Value | Count | Frequency (%) |
| | 126 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23626272 | |
| Common | 8029352 | 25.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2379871 | 10.1% |
| o | 2379316 | 10.1% |
| e | 1741456 | 7.4% |
| n | 1661294 | 7.0% |
| i | 1563818 | 6.6% |
| t | 1519591 | 6.4% |
| r | 1285474 | 5.4% |
| l | 960809 | 4.1% |
| u | 734887 | 3.1% |
| s | 723098 | 3.1% |
| Other values (66) | 8676658 |
Common
| Value | Count | Frequency (%) |
| 5156973 | ||
| , | 845140 | 10.5% |
| . | 367701 | 4.6% |
| 1 | 222198 | 2.8% |
| ( | 200029 | 2.5% |
| ) | 200007 | 2.5% |
| 0 | 175576 | 2.2% |
| 2 | 157851 | 2.0% |
| 5 | 124586 | 1.6% |
| 3 | 116675 | 1.5% |
| Other values (24) | 462616 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31626455 | |
| None | 29146 | 0.1% |
| Latin Ext Additional | 13 | < 0.1% |
| Punctuation | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5156973 | ||
| a | 2379871 | 7.5% |
| o | 2379316 | 7.5% |
| e | 1741456 | 5.5% |
| n | 1661294 | 5.3% |
| i | 1563818 | 4.9% |
| t | 1519591 | 4.8% |
| r | 1285474 | 4.1% |
| l | 960809 | 3.0% |
| , | 845140 | 2.7% |
| Other values (73) | 12132713 |
None
| Value | Count | Frequency (%) |
| í | 24109 | |
| é | 1678 | 5.8% |
| á | 1098 | 3.8% |
| ñ | 788 | 2.7% |
| â | 452 | 1.6% |
| ó | 240 | 0.8% |
| ú | 196 | 0.7% |
| ô | 169 | 0.6% |
| | 126 | 0.4% |
| è | 59 | 0.2% |
| Other values (12) | 231 | 0.8% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ấ | 9 | |
| ạ | 2 | 15.4% |
| ể | 2 | 15.4% |
Punctuation
| Value | Count | Frequency (%) |
| – | 9 | |
| ‚ | 1 | 10.0% |
Missing 
| Distinct | 1393 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 332173 |
| Missing (%) | 56.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.17383386 |
| Min length | 3 |
Unique
| Unique | 172 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 1317.0 |
|---|---|
| 2nd row | 1326.0 |
| 3rd row | 2200.0 |
| 4th row | 30.0 |
| 5th row | 9.0 |
| Value | Count | Frequency (%) |
| 335.0 | 5696 | 2.3% |
| 1067.0 | 4475 | 1.8% |
| 200.0 | 3544 | 1.4% |
| 1036.0 | 3021 | 1.2% |
| 91.0 | 2873 | 1.1% |
| 3.0 | 2426 | 1.0% |
| 280.0 | 2242 | 0.9% |
| 6.0 | 2185 | 0.9% |
| 320.0 | 2140 | 0.8% |
| 30.0 | 2121 | 0.8% |
| Other values (1380) | 221305 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 367971 | |
| . | 252028 | |
| 1 | 174955 | |
| 2 | 79232 | 6.1% |
| 3 | 77185 | 5.9% |
| 5 | 67603 | 5.2% |
| 4 | 64449 | 4.9% |
| 6 | 61727 | 4.7% |
| 9 | 55836 | 4.3% |
| 7 | 54079 | 4.1% |
| Other values (2) | 48886 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1051918 | |
| Other Punctuation | 252028 | 19.3% |
| Dash Punctuation | 5 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 367971 | |
| 1 | 174955 | |
| 2 | 79232 | 7.5% |
| 3 | 77185 | 7.3% |
| 5 | 67603 | 6.4% |
| 4 | 64449 | 6.1% |
| 6 | 61727 | 5.9% |
| 9 | 55836 | 5.3% |
| 7 | 54079 | 5.1% |
| 8 | 48881 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 252028 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1303951 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 367971 | |
| . | 252028 | |
| 1 | 174955 | |
| 2 | 79232 | 6.1% |
| 3 | 77185 | 5.9% |
| 5 | 67603 | 5.2% |
| 4 | 64449 | 4.9% |
| 6 | 61727 | 4.7% |
| 9 | 55836 | 4.3% |
| 7 | 54079 | 4.1% |
| Other values (2) | 48886 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1303951 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 367971 | |
| . | 252028 | |
| 1 | 174955 | |
| 2 | 79232 | 6.1% |
| 3 | 77185 | 5.9% |
| 5 | 67603 | 5.2% |
| 4 | 64449 | 4.9% |
| 6 | 61727 | 4.7% |
| 9 | 55836 | 4.3% |
| 7 | 54079 | 4.1% |
| Other values (2) | 48886 | 3.7% |
Missing 
| Distinct | 1386 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 333225 |
| Missing (%) | 57.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.186667251 |
| Min length | 3 |
Unique
| Unique | 173 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 1317.0 |
|---|---|
| 2nd row | 1326.0 |
| 3rd row | 2200.0 |
| 4th row | 50.0 |
| 5th row | 9.0 |
| Value | Count | Frequency (%) |
| 411.0 | 5198 | 2.1% |
| 1067.0 | 4371 | 1.7% |
| 1036.0 | 3919 | 1.6% |
| 200.0 | 2888 | 1.2% |
| 1146.0 | 2811 | 1.1% |
| 975.0 | 2590 | 1.0% |
| 280.0 | 2519 | 1.0% |
| 3.0 | 2326 | 0.9% |
| 6.0 | 2222 | 0.9% |
| 1189.0 | 2174 | 0.9% |
| Other values (1373) | 219958 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 362944 | |
| . | 250976 | |
| 1 | 183815 | |
| 2 | 79825 | 6.1% |
| 3 | 69914 | 5.4% |
| 4 | 67822 | 5.2% |
| 6 | 62985 | 4.8% |
| 5 | 62697 | 4.8% |
| 7 | 57607 | 4.4% |
| 9 | 54462 | 4.2% |
| Other values (2) | 48682 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1050748 | |
| Other Punctuation | 250976 | 19.3% |
| Dash Punctuation | 5 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 362944 | |
| 1 | 183815 | |
| 2 | 79825 | 7.6% |
| 3 | 69914 | 6.7% |
| 4 | 67822 | 6.5% |
| 6 | 62985 | 6.0% |
| 5 | 62697 | 6.0% |
| 7 | 57607 | 5.5% |
| 9 | 54462 | 5.2% |
| 8 | 48677 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 250976 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1301729 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 362944 | |
| . | 250976 | |
| 1 | 183815 | |
| 2 | 79825 | 6.1% |
| 3 | 69914 | 5.4% |
| 4 | 67822 | 5.2% |
| 6 | 62985 | 4.8% |
| 5 | 62697 | 4.8% |
| 7 | 57607 | 4.4% |
| 9 | 54462 | 4.2% |
| Other values (2) | 48682 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1301729 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 362944 | |
| . | 250976 | |
| 1 | 183815 | |
| 2 | 79825 | 6.1% |
| 3 | 69914 | 5.4% |
| 4 | 67822 | 5.2% |
| 6 | 62985 | 4.8% |
| 5 | 62697 | 4.8% |
| 7 | 57607 | 4.4% |
| 9 | 54462 | 4.2% |
| Other values (2) | 48682 | 3.7% |
Missing 
| Distinct | 2882 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 331608 |
| Missing (%) | 56.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 93 |
|---|---|
| Median length | 46 |
| Mean length | 7.093015246 |
| Min length | 3 |
Unique
| Unique | 530 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 4320 ft |
|---|---|
| 2nd row | 4351 ft |
| 3rd row | 2200 m |
| 4th row | 30-50 m |
| 5th row | 30 ft |
| Value | Count | Frequency (%) |
| ft | 191831 | |
| m | 59860 | 11.5% |
| ca | 13358 | 2.6% |
| 1100-1350 | 4058 | 0.8% |
| 200 | 3781 | 0.7% |
| 10 | 3450 | 0.7% |
| 3400 | 2848 | 0.5% |
| 3500 | 2819 | 0.5% |
| 20 | 2706 | 0.5% |
| 3600 | 2513 | 0.5% |
| Other values (2009) | 234300 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 376273 | |
| 268931 | ||
| t | 192412 | |
| f | 192004 | |
| 1 | 99566 | 5.6% |
| 3 | 96808 | 5.4% |
| 2 | 90988 | 5.1% |
| 4 | 83319 | 4.7% |
| 5 | 76675 | 4.3% |
| m | 59946 | 3.3% |
| Other values (47) | 254724 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 994929 | |
| Lowercase Letter | 481690 | |
| Space Separator | 268931 | 15.0% |
| Dash Punctuation | 30052 | 1.7% |
| Other Punctuation | 13757 | 0.8% |
| Close Punctuation | 1006 | 0.1% |
| Open Punctuation | 1006 | 0.1% |
| Math Symbol | 195 | < 0.1% |
| Uppercase Letter | 80 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 192412 | |
| f | 192004 | |
| m | 59946 | 12.4% |
| a | 14859 | 3.1% |
| c | 13366 | 2.8% |
| e | 3277 | 0.7% |
| l | 1590 | 0.3% |
| v | 1058 | 0.2% |
| s | 835 | 0.2% |
| o | 611 | 0.1% |
| Other values (15) | 1732 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 376273 | |
| 1 | 99566 | 10.0% |
| 3 | 96808 | 9.7% |
| 2 | 90988 | 9.1% |
| 4 | 83319 | 8.4% |
| 5 | 76675 | 7.7% |
| 6 | 59540 | 6.0% |
| 8 | 45372 | 4.6% |
| 7 | 38030 | 3.8% |
| 9 | 28358 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 23 | |
| S | 15 | |
| P | 12 | |
| G | 12 | |
| A | 10 | |
| D | 5 | 6.2% |
| L | 2 | 2.5% |
| M | 1 | 1.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 13576 | |
| , | 90 | 0.7% |
| / | 39 | 0.3% |
| ; | 22 | 0.2% |
| ? | 22 | 0.2% |
| ' | 6 | < 0.1% |
| ‡ | 2 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 110 | |
| + | 75 | |
| = | 10 | 5.1% |
Space Separator
| Value | Count | Frequency (%) |
| 268931 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 30052 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1006 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1006 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1309876 | |
| Latin | 481770 | 26.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 192412 | |
| f | 192004 | |
| m | 59946 | 12.4% |
| a | 14859 | 3.1% |
| c | 13366 | 2.8% |
| e | 3277 | 0.7% |
| l | 1590 | 0.3% |
| v | 1058 | 0.2% |
| s | 835 | 0.2% |
| o | 611 | 0.1% |
| Other values (23) | 1812 | 0.4% |
Common
| Value | Count | Frequency (%) |
| 0 | 376273 | |
| 268931 | ||
| 1 | 99566 | 7.6% |
| 3 | 96808 | 7.4% |
| 2 | 90988 | 6.9% |
| 4 | 83319 | 6.4% |
| 5 | 76675 | 5.9% |
| 6 | 59540 | 4.5% |
| 8 | 45372 | 3.5% |
| 7 | 38030 | 2.9% |
| Other values (14) | 74374 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1791644 | |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 376273 | |
| 268931 | ||
| t | 192412 | |
| f | 192004 | |
| 1 | 99566 | 5.6% |
| 3 | 96808 | 5.4% |
| 2 | 90988 | 5.1% |
| 4 | 83319 | 4.7% |
| 5 | 76675 | 4.3% |
| m | 59946 | 3.3% |
| Other values (46) | 254722 |
Punctuation
| Value | Count | Frequency (%) |
| ‡ | 2 |
decimalLatitude
Text
Missing 
| Distinct | 23890 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 162901 |
| Missing (%) | 27.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 6.648575837 |
| Min length | 3 |
Unique
| Unique | 7815 ? |
|---|---|
| Unique (%) | 1.9% |
Sample
| 1st row | -8.8201 |
|---|---|
| 2nd row | 35.8083 |
| 3rd row | 12.0217 |
| 4th row | 38.39 |
| 5th row | 40.9582 |
| Value | Count | Frequency (%) |
| 39.6306 | 4296 | 1.0% |
| 13.6389 | 2247 | 0.5% |
| 39.8872 | 1888 | 0.4% |
| 12.83 | 1754 | 0.4% |
| 26.9844 | 1718 | 0.4% |
| 4.0147 | 1664 | 0.4% |
| 37.4161 | 1535 | 0.4% |
| 36.7631 | 1511 | 0.4% |
| 25.4017 | 1483 | 0.4% |
| 36.9486 | 1468 | 0.3% |
| Other values (23415) | 401736 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 469957 | |
| . | 421300 | |
| 1 | 234920 | |
| 6 | 225682 | |
| 8 | 222143 | |
| 4 | 219369 | |
| 5 | 217631 | |
| 7 | 201596 | |
| 2 | 197526 | |
| 9 | 188366 | |
| Other values (3) | 202555 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2320649 | |
| Other Punctuation | 421300 | 15.0% |
| Dash Punctuation | 59033 | 2.1% |
| Uppercase Letter | 63 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 469957 | |
| 1 | 234920 | |
| 6 | 225682 | |
| 8 | 222143 | |
| 4 | 219369 | |
| 5 | 217631 | |
| 7 | 201596 | |
| 2 | 197526 | |
| 9 | 188366 | |
| 0 | 143459 | 6.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 421300 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 59033 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 63 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2800982 | |
| Latin | 63 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 469957 | |
| . | 421300 | |
| 1 | 234920 | |
| 6 | 225682 | |
| 8 | 222143 | |
| 4 | 219369 | |
| 5 | 217631 | |
| 7 | 201596 | |
| 2 | 197526 | |
| 9 | 188366 | |
| Other values (2) | 202492 |
Latin
| Value | Count | Frequency (%) |
| E | 63 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2801045 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 469957 | |
| . | 421300 | |
| 1 | 234920 | |
| 6 | 225682 | |
| 8 | 222143 | |
| 4 | 219369 | |
| 5 | 217631 | |
| 7 | 201596 | |
| 2 | 197526 | |
| 9 | 188366 | |
| Other values (3) | 202555 |
decimalLongitude
Text
Missing 
| Distinct | 24293 |
|---|---|
| Distinct (%) | 5.8% |
| Missing | 162901 |
| Missing (%) | 27.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.51880845 |
| Min length | 3 |
Unique
| Unique | 7784 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | 146.53 |
|---|---|
| 2nd row | -82.3481 |
| 3rd row | -61.7664 |
| 4th row | -79.25 |
| 5th row | -115.434 |
| Value | Count | Frequency (%) |
| 77.4714 | 4296 | 1.0% |
| 144.962 | 2247 | 0.5% |
| 77.7786 | 2139 | 0.5% |
| 87.1889 | 1888 | 0.4% |
| 69.28 | 1763 | 0.4% |
| 81.4919 | 1718 | 0.4% |
| 80.5097 | 1653 | 0.4% |
| 81.2228 | 1509 | 0.4% |
| 80.6567 | 1483 | 0.4% |
| 79.5561 | 1463 | 0.3% |
| Other values (24157) | 401141 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 421300 | |
| - | 381965 | |
| 7 | 372583 | |
| 8 | 353308 | |
| 1 | 252479 | |
| 3 | 236540 | |
| 6 | 221739 | |
| 9 | 209505 | |
| 4 | 196861 | |
| 2 | 192014 | |
| Other values (2) | 329380 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2364409 | |
| Other Punctuation | 421300 | 13.3% |
| Dash Punctuation | 381965 | 12.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 372583 | |
| 8 | 353308 | |
| 1 | 252479 | |
| 3 | 236540 | |
| 6 | 221739 | |
| 9 | 209505 | |
| 4 | 196861 | |
| 2 | 192014 | |
| 5 | 191410 | |
| 0 | 137970 | 5.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 421300 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 381965 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3167674 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 421300 | |
| - | 381965 | |
| 7 | 372583 | |
| 8 | 353308 | |
| 1 | 252479 | |
| 3 | 236540 | |
| 6 | 221739 | |
| 9 | 209505 | |
| 4 | 196861 | |
| 2 | 192014 | |
| Other values (2) | 329380 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3167674 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 421300 | |
| - | 381965 | |
| 7 | 372583 | |
| 8 | 353308 | |
| 1 | 252479 | |
| 3 | 236540 | |
| 6 | 221739 | |
| 9 | 209505 | |
| 4 | 196861 | |
| 2 | 192014 | |
| Other values (2) | 329380 |
geodeticDatum
Text
Missing 
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 438700 |
| Missing (%) | 75.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 5 |
| Mean length | 5.585164363 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | WGS84 |
|---|---|
| 2nd row | NAD27 |
| 3rd row | WGS84 |
| 4th row | WGS84 |
| 5th row | WGS84 |
| Value | Count | Frequency (%) |
| wgs84 | 84632 | |
| nad27 | 33007 | 21.0% |
| nad83 | 8733 | 5.5% |
| prp_m | 8459 | 5.4% |
| not | 4217 | 2.7% |
| recorded | 4217 | 2.7% |
| agd66 | 2352 | 1.5% |
| japanese | 1809 | 1.1% |
| geodetic | 1809 | 1.1% |
| datum | 1809 | 1.1% |
| Other values (22) | 6483 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 94697 | |
| G | 90266 | |
| 4 | 86068 | |
| S | 85622 | |
| W | 85612 | |
| D | 46433 | 5.7% |
| A | 44645 | 5.5% |
| N | 42091 | 5.2% |
| 2 | 34845 | 4.3% |
| 7 | 33036 | 4.1% |
| Other values (36) | 169332 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 431894 | |
| Decimal Number | 268384 | |
| Lowercase Letter | 91871 | 11.3% |
| Space Separator | 12026 | 1.5% |
| Connector Punctuation | 8459 | 1.0% |
| Open Punctuation | 6 | < 0.1% |
| Close Punctuation | 6 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 18818 | |
| d | 11435 | |
| o | 10848 | |
| t | 9034 | |
| r | 8415 | |
| n | 6934 | 7.5% |
| a | 6788 | 7.4% |
| c | 6629 | 7.2% |
| u | 3002 | 3.3% |
| m | 2412 | 2.6% |
| Other values (8) | 7556 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 90266 | |
| S | 85622 | |
| W | 85612 | |
| D | 46433 | |
| A | 44645 | |
| N | 42091 | |
| P | 16928 | 3.9% |
| R | 8667 | 2.0% |
| M | 8459 | 2.0% |
| J | 1809 | 0.4% |
| Other values (3) | 1362 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 94697 | |
| 4 | 86068 | |
| 2 | 34845 | 13.0% |
| 7 | 33036 | 12.3% |
| 3 | 8786 | 3.3% |
| 0 | 5427 | 2.0% |
| 6 | 4705 | 1.8% |
| 9 | 488 | 0.2% |
| 1 | 170 | 0.1% |
| 5 | 162 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 12026 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8459 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 523765 | |
| Common | 288882 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 90266 | |
| S | 85622 | |
| W | 85612 | |
| D | 46433 | |
| A | 44645 | |
| N | 42091 | |
| e | 18818 | 3.6% |
| P | 16928 | 3.2% |
| d | 11435 | 2.2% |
| o | 10848 | 2.1% |
| Other values (21) | 71067 |
Common
| Value | Count | Frequency (%) |
| 8 | 94697 | |
| 4 | 86068 | |
| 2 | 34845 | 12.1% |
| 7 | 33036 | 11.4% |
| 12026 | 4.2% | |
| 3 | 8786 | 3.0% |
| _ | 8459 | 2.9% |
| 0 | 5427 | 1.9% |
| 6 | 4705 | 1.6% |
| 9 | 488 | 0.2% |
| Other values (5) | 345 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 812647 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 94697 | |
| G | 90266 | |
| 4 | 86068 | |
| S | 85622 | |
| W | 85612 | |
| D | 46433 | 5.7% |
| A | 44645 | 5.5% |
| N | 42091 | 5.2% |
| 2 | 34845 | 4.3% |
| 7 | 33036 | 4.1% |
| Other values (36) | 169332 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 7372 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 439218 |
| Missing (%) | 75.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 5.852920687 |
| Min length | 1 |
Unique
| Unique | 2163 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | 402.336 |
|---|---|
| 2nd row | 96.5606 |
| 3rd row | 152901 |
| 4th row | 6115 |
| 5th row | 1754.18 |
| Value | Count | Frequency (%) |
| 347.618 | 1384 | 1.0% |
| 186.684 | 1338 | 0.9% |
| 4615 | 1110 | 0.8% |
| 5615 | 1066 | 0.7% |
| 1066 | 1030 | 0.7% |
| 3615 | 978 | 0.7% |
| 5115 | 953 | 0.7% |
| 4115 | 946 | 0.7% |
| 177.028 | 882 | 0.6% |
| 402.336 | 826 | 0.6% |
| Other values (7362) | 134470 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 113723 | |
| . | 89782 | |
| 2 | 84938 | |
| 5 | 81478 | |
| 3 | 79563 | |
| 4 | 78325 | |
| 6 | 73067 | |
| 9 | 65740 | |
| 8 | 63393 | |
| 7 | 60182 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 758792 | |
| Other Punctuation | 89782 | 10.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 113723 | |
| 2 | 84938 | |
| 5 | 81478 | |
| 3 | 79563 | |
| 4 | 78325 | |
| 6 | 73067 | |
| 9 | 65740 | |
| 8 | 63393 | |
| 7 | 60182 | |
| 0 | 58383 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 89782 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 848574 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 113723 | |
| . | 89782 | |
| 2 | 84938 | |
| 5 | 81478 | |
| 3 | 79563 | |
| 4 | 78325 | |
| 6 | 73067 | |
| 9 | 65740 | |
| 8 | 63393 | |
| 7 | 60182 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 848574 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 113723 | |
| . | 89782 | |
| 2 | 84938 | |
| 5 | 81478 | |
| 3 | 79563 | |
| 4 | 78325 | |
| 6 | 73067 | |
| 9 | 65740 | |
| 8 | 63393 | |
| 7 | 60182 |
verbatimLatitude
Text
Missing 
| Distinct | 8743 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 334540 |
| Missing (%) | 57.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 10 |
| Mean length | 9.947965441 |
| Min length | 1 |
Unique
| Unique | 2180 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | 35 48 30 N |
|---|---|
| 2nd row | 15 38 -- N |
| 3rd row | 18 27 30 N |
| 4th row | 37 27 15 N |
| 5th row | 38 02 59 N |
| Value | Count | Frequency (%) |
| n | 226021 | |
| 35 | 53191 | 5.4% |
| 38 | 41911 | 4.3% |
| 39 | 37026 | 3.8% |
| 37 | 36639 | 3.8% |
| 31610 | 3.2% | |
| 36 | 31118 | 3.2% |
| s | 20880 | 2.1% |
| 40 | 16364 | 1.7% |
| 50 | 15334 | 1.6% |
| Other values (1884) | 466720 |
Most occurring characters
| Value | Count | Frequency (%) |
| 727153 | ||
| 3 | 317112 | |
| N | 226318 | 9.1% |
| 5 | 189972 | 7.6% |
| 0 | 159182 | 6.4% |
| 2 | 158470 | 6.4% |
| 4 | 154033 | 6.2% |
| 1 | 141432 | 5.7% |
| 8 | 85002 | 3.4% |
| 7 | 81967 | 3.3% |
| Other values (12) | 242978 | 9.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1433412 | |
| Space Separator | 727153 | |
| Uppercase Letter | 247343 | 10.0% |
| Dash Punctuation | 63417 | 2.6% |
| Other Punctuation | 12231 | 0.5% |
| Other Symbol | 38 | < 0.1% |
| Modifier Letter | 24 | < 0.1% |
| Final Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 317112 | |
| 5 | 189972 | |
| 0 | 159182 | |
| 2 | 158470 | |
| 4 | 154033 | |
| 1 | 141432 | |
| 8 | 85002 | 5.9% |
| 7 | 81967 | 5.7% |
| 9 | 73456 | 5.1% |
| 6 | 72786 | 5.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11967 | |
| ' | 158 | 1.3% |
| " | 51 | 0.4% |
| ? | 38 | 0.3% |
| ; | 17 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 226318 | |
| S | 21025 | 8.5% |
Space Separator
| Value | Count | Frequency (%) |
| 727153 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 63417 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 38 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 24 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2236276 | |
| Latin | 247343 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 727153 | ||
| 3 | 317112 | |
| 5 | 189972 | 8.5% |
| 0 | 159182 | 7.1% |
| 2 | 158470 | 7.1% |
| 4 | 154033 | 6.9% |
| 1 | 141432 | 6.3% |
| 8 | 85002 | 3.8% |
| 7 | 81967 | 3.7% |
| 9 | 73456 | 3.3% |
| Other values (10) | 148497 | 6.6% |
Latin
| Value | Count | Frequency (%) |
| N | 226318 | |
| S | 21025 | 8.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2483556 | |
| None | 38 | < 0.1% |
| Modifier Letters | 24 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 727153 | ||
| 3 | 317112 | |
| N | 226318 | 9.1% |
| 5 | 189972 | 7.6% |
| 0 | 159182 | 6.4% |
| 2 | 158470 | 6.4% |
| 4 | 154033 | 6.2% |
| 1 | 141432 | 5.7% |
| 8 | 85002 | 3.4% |
| 7 | 81967 | 3.3% |
| Other values (9) | 242915 | 9.8% |
None
| Value | Count | Frequency (%) |
| ° | 38 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 24 |
Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Missing 
| Distinct | 9294 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 334562 |
| Missing (%) | 57.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 11 |
| Mean length | 10.89844536 |
| Min length | 3 |
Unique
| Unique | 2355 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | 082 20 53 W |
|---|---|
| 2nd row | 088 15 -- W |
| 3rd row | 063 33 13 W |
| 4th row | 077 05 15 W |
| 5th row | 77 41 22 W |
| Value | Count | Frequency (%) |
| w | 225797 | |
| 083 | 32649 | 3.3% |
| 32039 | 3.3% | |
| e | 21036 | 2.2% |
| 077 | 20721 | 2.1% |
| 081 | 18686 | 1.9% |
| 080 | 18538 | 1.9% |
| 076 | 17637 | 1.8% |
| 078 | 17121 | 1.8% |
| 079 | 16537 | 1.7% |
| Other values (2116) | 555848 |
Most occurring characters
| Value | Count | Frequency (%) |
| 726970 | ||
| 0 | 378660 | |
| W | 225899 | 8.3% |
| 8 | 184727 | 6.8% |
| 3 | 182195 | 6.7% |
| 7 | 168795 | 6.2% |
| 1 | 163256 | 6.0% |
| 2 | 149535 | 5.5% |
| 4 | 149396 | 5.5% |
| 5 | 148151 | 5.4% |
| Other values (15) | 243093 | 8.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1668307 | |
| Space Separator | 726970 | |
| Uppercase Letter | 247315 | 9.1% |
| Dash Punctuation | 65666 | 2.4% |
| Other Punctuation | 12208 | 0.4% |
| Open Punctuation | 73 | < 0.1% |
| Close Punctuation | 73 | < 0.1% |
| Other Symbol | 39 | < 0.1% |
| Modifier Letter | 24 | < 0.1% |
| Final Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 378660 | |
| 8 | 184727 | |
| 3 | 182195 | |
| 7 | 168795 | |
| 1 | 163256 | |
| 2 | 149535 | 9.0% |
| 4 | 149396 | 9.0% |
| 5 | 148151 | 8.9% |
| 9 | 72593 | 4.4% |
| 6 | 70999 | 4.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11959 | |
| ' | 144 | 1.2% |
| " | 50 | 0.4% |
| ? | 38 | 0.3% |
| ; | 17 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 225899 | |
| E | 21415 | 8.7% |
| S | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 726970 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 65666 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 73 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 73 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 39 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 24 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2473362 | |
| Latin | 247315 | 9.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 726970 | ||
| 0 | 378660 | |
| 8 | 184727 | 7.5% |
| 3 | 182195 | 7.4% |
| 7 | 168795 | 6.8% |
| 1 | 163256 | 6.6% |
| 2 | 149535 | 6.0% |
| 4 | 149396 | 6.0% |
| 5 | 148151 | 6.0% |
| 9 | 72593 | 2.9% |
| Other values (12) | 149084 | 6.0% |
Latin
| Value | Count | Frequency (%) |
| W | 225899 | |
| E | 21415 | 8.7% |
| S | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2720612 | |
| None | 39 | < 0.1% |
| Modifier Letters | 24 | < 0.1% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 726970 | ||
| 0 | 378660 | |
| W | 225899 | 8.3% |
| 8 | 184727 | 6.8% |
| 3 | 182195 | 6.7% |
| 7 | 168795 | 6.2% |
| 1 | 163256 | 6.0% |
| 2 | 149535 | 5.5% |
| 4 | 149396 | 5.5% |
| 5 | 148151 | 5.4% |
| Other values (12) | 243028 | 8.9% |
None
| Value | Count | Frequency (%) |
| ° | 39 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 24 |
Punctuation
| Value | Count | Frequency (%) |
| ” | 2 |
Missing 
| Distinct | 3371 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 439136 |
| Missing (%) | 75.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 302 |
|---|---|
| Median length | 251 |
| Mean length | 91.26128977 |
| Min length | 3 |
Unique
| Unique | 891 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | USGS Palo Alto Quad (TopoZone - 1:24,000), MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
|---|---|
| 2nd row | Terrain Navigator v. 5.03 USGS 1:24,000, MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
| 3rd row | Alexandria Digital Library Gazetteer, MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
| 4th row | USGS Chesterfield Quad (TopoZine - 1:24,000), MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
| 5th row | USGS Falls Church Quad (TopoZone - 1:24,000), MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
| Value | Count | Frequency (%) |
| georeferencing | 134216 | 9.7% |
| manis/herpnet/ornis | 134163 | 9.7% |
| guidelines | 134143 | 9.7% |
| usgs | 59079 | 4.3% |
| 1:24,000 | 54333 | 3.9% |
| 44136 | 3.2% | |
| quad | 39827 | 2.9% |
| digital | 22588 | 1.6% |
| gazetteer | 22105 | 1.6% |
| topozone | 21638 | 1.6% |
| Other values (3792) | 715459 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1320173 | 10.0% |
| 1236622 | 9.3% | |
| r | 733799 | 5.5% |
| i | 691510 | 5.2% |
| a | 629206 | 4.8% |
| n | 622138 | 4.7% |
| o | 500801 | 3.8% |
| N | 461182 | 3.5% |
| S | 454207 | 3.4% |
| G | 414644 | 3.1% |
| Other values (76) | 6174537 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7136568 | |
| Uppercase Letter | 3060694 | |
| Space Separator | 1236622 | 9.3% |
| Decimal Number | 835786 | 6.3% |
| Other Punctuation | 760937 | 5.7% |
| Open Punctuation | 71491 | 0.5% |
| Close Punctuation | 71272 | 0.5% |
| Dash Punctuation | 65161 | 0.5% |
| Connector Punctuation | 248 | < 0.1% |
| Math Symbol | 40 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1320173 | |
| r | 733799 | |
| i | 691510 | |
| a | 629206 | |
| n | 622138 | |
| o | 500801 | 7.0% |
| l | 307980 | 4.3% |
| d | 294924 | 4.1% |
| t | 258449 | 3.6% |
| g | 250955 | 3.5% |
| Other values (19) | 1526633 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 461182 | |
| S | 454207 | |
| G | 414644 | |
| I | 303621 | |
| T | 221610 | |
| M | 189899 | |
| E | 166244 | 5.4% |
| O | 161796 | 5.3% |
| R | 151192 | 4.9% |
| H | 140237 | 4.6% |
| Other values (17) | 396062 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 286534 | |
| , | 258402 | |
| : | 100996 | 13.3% |
| . | 80708 | 10.6% |
| ; | 15057 | 2.0% |
| ! | 9034 | 1.2% |
| # | 6647 | 0.9% |
| ' | 2637 | 0.3% |
| & | 813 | 0.1% |
| ? | 94 | < 0.1% |
| Other values (3) | 15 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 379892 | |
| 1 | 133690 | 16.0% |
| 2 | 100269 | 12.0% |
| 4 | 76915 | 9.2% |
| 5 | 38693 | 4.6% |
| 7 | 25544 | 3.1% |
| 9 | 22590 | 2.7% |
| 6 | 22338 | 2.7% |
| 3 | 22202 | 2.7% |
| 8 | 13653 | 1.6% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 24 | |
| = | 16 |
Space Separator
| Value | Count | Frequency (%) |
| 1236622 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 71491 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 71272 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 65161 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 248 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10197262 | |
| Common | 3041557 | 23.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1320173 | 12.9% |
| r | 733799 | 7.2% |
| i | 691510 | 6.8% |
| a | 629206 | 6.2% |
| n | 622138 | 6.1% |
| o | 500801 | 4.9% |
| N | 461182 | 4.5% |
| S | 454207 | 4.5% |
| G | 414644 | 4.1% |
| l | 307980 | 3.0% |
| Other values (46) | 4061622 |
Common
| Value | Count | Frequency (%) |
| 1236622 | ||
| 0 | 379892 | 12.5% |
| / | 286534 | 9.4% |
| , | 258402 | 8.5% |
| 1 | 133690 | 4.4% |
| : | 100996 | 3.3% |
| 2 | 100269 | 3.3% |
| . | 80708 | 2.7% |
| 4 | 76915 | 2.5% |
| ( | 71491 | 2.4% |
| Other values (20) | 316038 | 10.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13234776 | |
| None | 4039 | < 0.1% |
| Punctuation | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1320173 | 10.0% |
| 1236622 | 9.3% | |
| r | 733799 | 5.5% |
| i | 691510 | 5.2% |
| a | 629206 | 4.8% |
| n | 622138 | 4.7% |
| o | 500801 | 3.8% |
| N | 461182 | 3.5% |
| S | 454207 | 3.4% |
| G | 414644 | 3.1% |
| Other values (71) | 6170494 |
None
| Value | Count | Frequency (%) |
| í | 4030 | |
| é | 5 | 0.1% |
| ô | 2 | < 0.1% |
| Î | 2 | < 0.1% |
Punctuation
| Value | Count | Frequency (%) |
| ‡ | 4 |
Missing 
| Distinct | 3681 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 443625 |
| Missing (%) | 75.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 83 |
|---|---|
| Median length | 55 |
| Mean length | 22.53162702 |
| Min length | 7 |
Unique
| Unique | 1057 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Locality extent = 0.05 |
|---|---|
| 2nd row | Locality extent = 95 |
| 3rd row | Locality extent = 3.5 |
| 4th row | Datum Guam 63 |
| 5th row | Locality extent = 1.08 |
| Value | Count | Frequency (%) |
| extent | 134257 | |
| 134207 | ||
| locality | 134203 | |
| mi | 40072 | 6.6% |
| km | 8736 | 1.4% |
| 0.1 | 7251 | 1.2% |
| datum | 6200 | 1.0% |
| 63 | 5497 | 0.9% |
| guam | 5494 | 0.9% |
| 1 | 5323 | 0.9% |
| Other values (2938) | 128798 |
Most occurring characters
| Value | Count | Frequency (%) |
| 469462 | ||
| t | 411232 | |
| e | 269464 | 8.5% |
| i | 175099 | 5.5% |
| . | 149589 | 4.7% |
| a | 146541 | 4.6% |
| l | 134689 | 4.3% |
| n | 134567 | 4.2% |
| o | 134447 | 4.2% |
| y | 134376 | 4.2% |
| Other values (54) | 1007940 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1896549 | |
| Space Separator | 469462 | 14.8% |
| Decimal Number | 368654 | 11.6% |
| Other Punctuation | 149871 | 4.7% |
| Uppercase Letter | 148496 | 4.7% |
| Math Symbol | 134208 | 4.2% |
| Dash Punctuation | 72 | < 0.1% |
| Open Punctuation | 47 | < 0.1% |
| Close Punctuation | 47 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 411232 | |
| e | 269464 | |
| i | 175099 | |
| a | 146541 | 7.7% |
| l | 134689 | 7.1% |
| n | 134567 | 7.1% |
| o | 134447 | 7.1% |
| y | 134376 | 7.1% |
| x | 134300 | 7.1% |
| c | 134263 | 7.1% |
| Other values (14) | 87571 | 4.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 134266 | |
| G | 6166 | 4.2% |
| D | 6026 | 4.1% |
| S | 774 | 0.5% |
| W | 687 | 0.5% |
| H | 144 | 0.1% |
| N | 119 | 0.1% |
| P | 107 | 0.1% |
| E | 71 | < 0.1% |
| A | 37 | < 0.1% |
| Other values (9) | 99 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 74829 | |
| 1 | 61579 | |
| 5 | 51221 | |
| 2 | 46439 | |
| 3 | 35147 | |
| 6 | 23996 | 6.5% |
| 4 | 21925 | 5.9% |
| 7 | 21708 | 5.9% |
| 8 | 19177 | 5.2% |
| 9 | 12633 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 149589 | |
| ; | 174 | 0.1% |
| , | 71 | < 0.1% |
| : | 19 | < 0.1% |
| / | 12 | < 0.1% |
| ' | 6 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 469462 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 134208 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 72 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 47 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 47 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2045045 | |
| Common | 1122361 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 411232 | |
| e | 269464 | |
| i | 175099 | |
| a | 146541 | 7.2% |
| l | 134689 | 6.6% |
| n | 134567 | 6.6% |
| o | 134447 | 6.6% |
| y | 134376 | 6.6% |
| x | 134300 | 6.6% |
| L | 134266 | 6.6% |
| Other values (33) | 236064 |
Common
| Value | Count | Frequency (%) |
| 469462 | ||
| . | 149589 | 13.3% |
| = | 134208 | 12.0% |
| 0 | 74829 | 6.7% |
| 1 | 61579 | 5.5% |
| 5 | 51221 | 4.6% |
| 2 | 46439 | 4.1% |
| 3 | 35147 | 3.1% |
| 6 | 23996 | 2.1% |
| 4 | 21925 | 2.0% |
| Other values (11) | 53966 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3167406 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 469462 | ||
| t | 411232 | |
| e | 269464 | 8.5% |
| i | 175099 | 5.5% |
| . | 149589 | 4.7% |
| a | 146541 | 4.6% |
| l | 134689 | 4.3% |
| n | 134567 | 4.2% |
| o | 134447 | 4.2% |
| y | 134376 | 4.2% |
| Other values (54) | 1007940 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 583784 |
| Missing (%) | 99.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 3 |
| Mean length | 3.167865707 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | aff. |
|---|---|
| 2nd row | cf. |
| 3rd row | cf. |
| 4th row | cf. |
| 5th row | cf. |
| Value | Count | Frequency (%) |
| cf | 382 | |
| aff | 28 | 6.7% |
| uncertain | 7 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 438 | |
| . | 410 | |
| c | 389 | |
| a | 35 | 2.6% |
| n | 14 | 1.1% |
| u | 7 | 0.5% |
| e | 7 | 0.5% |
| r | 7 | 0.5% |
| t | 7 | 0.5% |
| i | 7 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 911 | |
| Other Punctuation | 410 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 438 | |
| c | 389 | |
| a | 35 | 3.8% |
| n | 14 | 1.5% |
| u | 7 | 0.8% |
| e | 7 | 0.8% |
| r | 7 | 0.8% |
| t | 7 | 0.8% |
| i | 7 | 0.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 410 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 911 | |
| Common | 410 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 438 | |
| c | 389 | |
| a | 35 | 3.8% |
| n | 14 | 1.5% |
| u | 7 | 0.8% |
| e | 7 | 0.8% |
| r | 7 | 0.8% |
| t | 7 | 0.8% |
| i | 7 | 0.8% |
Common
| Value | Count | Frequency (%) |
| . | 410 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1321 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 438 | |
| . | 410 | |
| c | 389 | |
| a | 35 | 2.6% |
| n | 14 | 1.1% |
| u | 7 | 0.5% |
| e | 7 | 0.5% |
| r | 7 | 0.5% |
| t | 7 | 0.5% |
| i | 7 | 0.5% |
typeStatus
Text
Missing 
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 570681 |
| Missing (%) | 97.7% |
| Memory size | 4.5 MiB |
Length
| Max length | 27 |
|---|---|
| Median length | 8 |
| Mean length | 8.390310651 |
| Min length | 7 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Paratype |
|---|---|
| 2nd row | Paratype |
| 3rd row | Paratype |
| 4th row | Paratype |
| 5th row | Paralectotype |
| Value | Count | Frequency (%) |
| paratype | 10833 | |
| holotype | 1225 | 8.8% |
| syntype | 1222 | 8.8% |
| paralectotype | 502 | 3.6% |
| lectotype | 104 | 0.7% |
| neotype | 25 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 22670 | |
| y | 15133 | |
| e | 14542 | |
| t | 14517 | |
| p | 13911 | |
| P | 11335 | |
| r | 11335 | |
| o | 3081 | 2.7% |
| l | 1727 | 1.5% |
| H | 1225 | 1.1% |
| Other values (7) | 3961 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 98744 | |
| Uppercase Letter | 13911 | 12.3% |
| Other Punctuation | 391 | 0.3% |
| Space Separator | 391 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 22670 | |
| y | 15133 | |
| e | 14542 | |
| t | 14517 | |
| p | 13911 | |
| r | 11335 | |
| o | 3081 | 3.1% |
| l | 1727 | 1.7% |
| n | 1222 | 1.2% |
| c | 606 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 11335 | |
| H | 1225 | 8.8% |
| S | 1222 | 8.8% |
| L | 104 | 0.7% |
| N | 25 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 391 |
Space Separator
| Value | Count | Frequency (%) |
| 391 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 112655 | |
| Common | 782 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 22670 | |
| y | 15133 | |
| e | 14542 | |
| t | 14517 | |
| p | 13911 | |
| P | 11335 | |
| r | 11335 | |
| o | 3081 | 2.7% |
| l | 1727 | 1.5% |
| H | 1225 | 1.1% |
| Other values (5) | 3179 | 2.8% |
Common
| Value | Count | Frequency (%) |
| ; | 391 | |
| 391 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 113437 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 22670 | |
| y | 15133 | |
| e | 14542 | |
| t | 14517 | |
| p | 13911 | |
| P | 11335 | |
| r | 11335 | |
| o | 3081 | 2.7% |
| l | 1727 | 1.5% |
| H | 1225 | 1.1% |
| Other values (7) | 3961 | 3.5% |
identifiedBy
Text
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 10.5% |
| Missing | 584125 |
| Missing (%) | > 99.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 122 |
|---|---|
| Median length | 18 |
| Mean length | 25.17105263 |
| Min length | 14 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 5.3% |
Sample
| 1st row | Gower, David, (BMNH), The Natural History Museum (UNITED KINGDOM) |
|---|---|
| 2nd row | Crombie, Ronald I. |
| 3rd row | Crombie, Ronald I. |
| 4th row | Crombie, Ronald I. |
| 5th row | Crombie, Ronald I. |
| Value | Count | Frequency (%) |
| ronald | 56 | |
| crombie | 55 | |
| i | 55 | |
| natural | 11 | 3.7% |
| history | 11 | 3.7% |
| museum | 11 | 3.7% |
| united | 11 | 3.7% |
| gower | 10 | 3.3% |
| david | 10 | 3.3% |
| bmnh | 10 | 3.3% |
| Other values (26) | 60 |
Most occurring characters
| Value | Count | Frequency (%) |
| 224 | 11.7% | |
| o | 146 | 7.6% |
| e | 102 | 5.3% |
| r | 99 | 5.2% |
| , | 98 | 5.1% |
| a | 95 | 5.0% |
| i | 87 | 4.5% |
| I | 77 | 4.0% |
| n | 73 | 3.8% |
| d | 73 | 3.8% |
| Other values (39) | 839 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1027 | |
| Uppercase Letter | 452 | |
| Space Separator | 224 | 11.7% |
| Other Punctuation | 163 | 8.5% |
| Close Punctuation | 22 | 1.2% |
| Open Punctuation | 22 | 1.2% |
| Dash Punctuation | 3 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 77 | |
| R | 61 | |
| C | 58 | |
| N | 43 | |
| M | 31 | |
| D | 31 | |
| H | 27 | 6.0% |
| G | 24 | 5.3% |
| T | 23 | 5.1% |
| E | 14 | 3.1% |
| Other values (12) | 63 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 146 | |
| e | 102 | |
| r | 99 | |
| a | 95 | |
| i | 87 | |
| n | 73 | |
| d | 73 | |
| l | 69 | |
| m | 68 | |
| b | 56 | 5.5% |
| Other values (11) | 159 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 98 | |
| . | 65 |
Space Separator
| Value | Count | Frequency (%) |
| 224 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 22 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 22 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1479 | |
| Common | 434 | 22.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 146 | 9.9% |
| e | 102 | 6.9% |
| r | 99 | 6.7% |
| a | 95 | 6.4% |
| i | 87 | 5.9% |
| I | 77 | 5.2% |
| n | 73 | 4.9% |
| d | 73 | 4.9% |
| l | 69 | 4.7% |
| m | 68 | 4.6% |
| Other values (33) | 590 |
Common
| Value | Count | Frequency (%) |
| 224 | ||
| , | 98 | |
| . | 65 | 15.0% |
| ) | 22 | 5.1% |
| ( | 22 | 5.1% |
| - | 3 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1913 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 224 | 11.7% | |
| o | 146 | 7.6% |
| e | 102 | 5.3% |
| r | 99 | 5.2% |
| , | 98 | 5.1% |
| a | 95 | 5.0% |
| i | 87 | 4.5% |
| I | 77 | 4.0% |
| n | 73 | 3.8% |
| d | 73 | 3.8% |
| Other values (39) | 839 |
scientificName
Text
| Distinct | 9530 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 62 |
|---|---|
| Median length | 56 |
| Mean length | 19.84556343 |
| Min length | 4 |
Unique
| Unique | 1890 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Carlia bicarinata |
|---|---|
| 2nd row | Plethodon montanus |
| 3rd row | Enhydris enhydris |
| 4th row | Gehyra mutilata |
| 5th row | Anolis richardii |
| Value | Count | Frequency (%) |
| plethodon | 168423 | 14.0% |
| cinereus | 75774 | 6.3% |
| desmognathus | 35846 | 3.0% |
| anolis | 18352 | 1.5% |
| glutinosus | 13372 | 1.1% |
| lithobates | 12991 | 1.1% |
| fuscus | 11321 | 0.9% |
| montanus | 10417 | 0.9% |
| eleutherodactylus | 9959 | 0.8% |
| anaxyrus | 9474 | 0.8% |
| Other values (7195) | 837184 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 976778 | 8.4% |
| o | 954528 | 8.2% |
| s | 947788 | 8.2% |
| a | 896948 | 7.7% |
| i | 821046 | 7.1% |
| n | 729826 | 6.3% |
| t | 711935 | 6.1% |
| l | 642687 | 5.5% |
| u | 635392 | 5.5% |
| r | 633897 | 5.5% |
| Other values (49) | 3642973 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10381385 | |
| Space Separator | 618912 | 5.3% |
| Uppercase Letter | 582830 | 5.0% |
| Other Punctuation | 10078 | 0.1% |
| Dash Punctuation | 561 | < 0.1% |
| Open Punctuation | 16 | < 0.1% |
| Close Punctuation | 16 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 976778 | |
| o | 954528 | 9.2% |
| s | 947788 | 9.1% |
| a | 896948 | 8.6% |
| i | 821046 | 7.9% |
| n | 729826 | 7.0% |
| t | 711935 | 6.9% |
| l | 642687 | 6.2% |
| u | 635392 | 6.1% |
| r | 633897 | 6.1% |
| Other values (16) | 2430560 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 210261 | |
| A | 59585 | 10.2% |
| D | 48691 | 8.4% |
| L | 39048 | 6.7% |
| E | 33682 | 5.8% |
| S | 33117 | 5.7% |
| C | 32240 | 5.5% |
| H | 26213 | 4.5% |
| T | 17139 | 2.9% |
| R | 13689 | 2.3% |
| Other values (15) | 69165 | 11.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 8496 | |
| . | 1545 | 15.3% |
| / | 21 | 0.2% |
| ? | 16 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 618912 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 561 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 16 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10964215 | |
| Common | 629583 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 976778 | 8.9% |
| o | 954528 | 8.7% |
| s | 947788 | 8.6% |
| a | 896948 | 8.2% |
| i | 821046 | 7.5% |
| n | 729826 | 6.7% |
| t | 711935 | 6.5% |
| l | 642687 | 5.9% |
| u | 635392 | 5.8% |
| r | 633897 | 5.8% |
| Other values (41) | 3013390 |
Common
| Value | Count | Frequency (%) |
| 618912 | ||
| " | 8496 | 1.3% |
| . | 1545 | 0.2% |
| - | 561 | 0.1% |
| / | 21 | < 0.1% |
| ( | 16 | < 0.1% |
| ? | 16 | < 0.1% |
| ) | 16 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11593798 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 976778 | 8.4% |
| o | 954528 | 8.2% |
| s | 947788 | 8.2% |
| a | 896948 | 7.7% |
| i | 821046 | 7.1% |
| n | 729826 | 6.3% |
| t | 711935 | 6.1% |
| l | 642687 | 5.5% |
| u | 635392 | 5.5% |
| r | 633897 | 5.5% |
| Other values (49) | 3642973 |
| Distinct | 167 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 86 |
|---|---|
| Median length | 82 |
| Mean length | 66.44007265 |
| Min length | 10 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia, Chordata, Vertebrata, Reptilia, Squamata, Sauria, Scincidae, Eugongylinae |
|---|---|
| 2nd row | Animalia, Chordata, Vertebrata, Amphibia, Caudata, Plethodontidae |
| 3rd row | Animalia, Chordata, Vertebrata, Reptilia, Squamata, Ophidia, Homalopsinae |
| 4th row | Animalia, Chordata, Vertebrata, Reptilia, Squamata, Sauria, Gekkoninae |
| 5th row | Animalia, Chordata, Vertebrata, Reptilia, Squamata, Sauria, Polychrotinae |
| Value | Count | Frequency (%) |
| animalia | 584195 | |
| vertebrata | 584195 | |
| chordata | 584178 | |
| amphibia | 395159 | |
| caudata | 237127 | |
| plethodontidae | 221369 | 5.9% |
| reptilia | 189036 | 5.1% |
| squamata | 169309 | 4.5% |
| anura | 157511 | 4.2% |
| sauria | 116154 | 3.1% |
| Other values (166) | 484544 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6566805 | |
| i | 3313617 | 8.5% |
| , | 3138578 | 8.1% |
| 3138578 | 8.1% | |
| t | 3000106 | 7.7% |
| e | 2360956 | 6.1% |
| r | 2244920 | 5.8% |
| d | 1648115 | 4.2% |
| h | 1357195 | 3.5% |
| n | 1355739 | 3.5% |
| Other values (36) | 10689615 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28814291 | |
| Uppercase Letter | 3722777 | 9.6% |
| Other Punctuation | 3138578 | 8.1% |
| Space Separator | 3138578 | 8.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6566805 | |
| i | 3313617 | |
| t | 3000106 | |
| e | 2360956 | 8.2% |
| r | 2244920 | 7.8% |
| d | 1648115 | 5.7% |
| h | 1357195 | 4.7% |
| n | 1355739 | 4.7% |
| o | 1350378 | 4.7% |
| m | 1224848 | 4.3% |
| Other values (14) | 4391612 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1151930 | |
| C | 876519 | |
| V | 590792 | |
| S | 343033 | 9.2% |
| P | 265039 | 7.1% |
| R | 211930 | 5.7% |
| O | 52750 | 1.4% |
| H | 46430 | 1.2% |
| E | 33840 | 0.9% |
| T | 33424 | 0.9% |
| Other values (10) | 117090 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3138578 |
Space Separator
| Value | Count | Frequency (%) |
| 3138578 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32537068 | |
| Common | 6277156 | 16.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6566805 | |
| i | 3313617 | |
| t | 3000106 | 9.2% |
| e | 2360956 | 7.3% |
| r | 2244920 | 6.9% |
| d | 1648115 | 5.1% |
| h | 1357195 | 4.2% |
| n | 1355739 | 4.2% |
| o | 1350378 | 4.2% |
| m | 1224848 | 3.8% |
| Other values (34) | 8114389 |
Common
| Value | Count | Frequency (%) |
| , | 3138578 | |
| 3138578 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38814224 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6566805 | |
| i | 3313617 | 8.5% |
| , | 3138578 | 8.1% |
| 3138578 | 8.1% | |
| t | 3000106 | 7.7% |
| e | 2360956 | 6.1% |
| r | 2244920 | 5.8% |
| d | 1648115 | 4.2% |
| h | 1357195 | 3.5% |
| n | 1355739 | 3.5% |
| Other values (36) | 10689615 |
kingdom
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 584195 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1168390 | |
| a | 1168390 | |
| A | 584195 | |
| n | 584195 | |
| m | 584195 | |
| l | 584195 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4089365 | |
| Uppercase Letter | 584195 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1168390 | |
| a | 1168390 | |
| n | 584195 | |
| m | 584195 | |
| l | 584195 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 584195 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4673560 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1168390 | |
| a | 1168390 | |
| A | 584195 | |
| n | 584195 | |
| m | 584195 | |
| l | 584195 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4673560 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1168390 | |
| a | 1168390 | |
| A | 584195 | |
| n | 584195 | |
| m | 584195 | |
| l | 584195 |
phylum
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 23 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Chordata |
|---|---|
| 2nd row | Chordata |
| 3rd row | Chordata |
| 4th row | Chordata |
| 5th row | Chordata |
| Value | Count | Frequency (%) |
| chordata | 584178 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1168356 | |
| C | 584178 | |
| h | 584178 | |
| o | 584178 | |
| r | 584178 | |
| d | 584178 | |
| t | 584178 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4089246 | |
| Uppercase Letter | 584178 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1168356 | |
| h | 584178 | |
| o | 584178 | |
| r | 584178 | |
| d | 584178 | |
| t | 584178 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 584178 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4673424 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1168356 | |
| C | 584178 | |
| h | 584178 | |
| o | 584178 | |
| r | 584178 | |
| d | 584178 | |
| t | 584178 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4673424 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1168356 | |
| C | 584178 | |
| h | 584178 | |
| o | 584178 | |
| r | 584178 | |
| d | 584178 | |
| t | 584178 |
class
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Reptilia |
|---|---|
| 2nd row | Amphibia |
| 3rd row | Reptilia |
| 4th row | Reptilia |
| 5th row | Reptilia |
| Value | Count | Frequency (%) |
| amphibia | 395159 | |
| reptilia | 189036 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1168390 | |
| p | 584195 | |
| a | 584195 | |
| A | 395159 | 8.5% |
| m | 395159 | 8.5% |
| h | 395159 | 8.5% |
| b | 395159 | 8.5% |
| R | 189036 | 4.0% |
| e | 189036 | 4.0% |
| t | 189036 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4089365 | |
| Uppercase Letter | 584195 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1168390 | |
| p | 584195 | |
| a | 584195 | |
| m | 395159 | 9.7% |
| h | 395159 | 9.7% |
| b | 395159 | 9.7% |
| e | 189036 | 4.6% |
| t | 189036 | 4.6% |
| l | 189036 | 4.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 395159 | |
| R | 189036 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4673560 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1168390 | |
| p | 584195 | |
| a | 584195 | |
| A | 395159 | 8.5% |
| m | 395159 | 8.5% |
| h | 395159 | 8.5% |
| b | 395159 | 8.5% |
| R | 189036 | 4.0% |
| e | 189036 | 4.0% |
| t | 189036 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4673560 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1168390 | |
| p | 584195 | |
| a | 584195 | |
| A | 395159 | 8.5% |
| m | 395159 | 8.5% |
| h | 395159 | 8.5% |
| b | 395159 | 8.5% |
| R | 189036 | 4.0% |
| e | 189036 | 4.0% |
| t | 189036 | 4.0% |
order
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 11 |
| Mean length | 6.855565351 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Squamata |
|---|---|
| 2nd row | Caudata |
| 3rd row | Squamata |
| 4th row | Squamata |
| 5th row | Squamata |
| Value | Count | Frequency (%) |
| caudata | 237127 | |
| squamata | 169309 | |
| anura | 157511 | |
| testudines | 18909 | 3.2% |
| crocodilia | 804 | 0.1% |
| gymnophiona | 521 | 0.1% |
| rhynchocephalia | 14 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1378172 | |
| u | 582856 | |
| t | 425345 | 10.6% |
| d | 256840 | 6.4% |
| C | 237931 | 5.9% |
| n | 177476 | 4.4% |
| m | 169830 | 4.2% |
| S | 169309 | 4.2% |
| q | 169309 | 4.2% |
| r | 158315 | 4.0% |
| Other values (13) | 279604 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3420792 | |
| Uppercase Letter | 584195 | 14.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1378172 | |
| u | 582856 | |
| t | 425345 | 12.4% |
| d | 256840 | 7.5% |
| n | 177476 | 5.2% |
| m | 169830 | 5.0% |
| q | 169309 | 4.9% |
| r | 158315 | 4.6% |
| e | 37832 | 1.1% |
| s | 37818 | 1.1% |
| Other values (7) | 26999 | 0.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 237931 | |
| S | 169309 | |
| A | 157511 | |
| T | 18909 | 3.2% |
| G | 521 | 0.1% |
| R | 14 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4004987 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1378172 | |
| u | 582856 | |
| t | 425345 | 10.6% |
| d | 256840 | 6.4% |
| C | 237931 | 5.9% |
| n | 177476 | 4.4% |
| m | 169830 | 4.2% |
| S | 169309 | 4.2% |
| q | 169309 | 4.2% |
| r | 158315 | 4.0% |
| Other values (13) | 279604 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4004987 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1378172 | |
| u | 582856 | |
| t | 425345 | 10.6% |
| d | 256840 | 6.4% |
| C | 237931 | 5.9% |
| n | 177476 | 4.4% |
| m | 169830 | 4.2% |
| S | 169309 | 4.2% |
| q | 169309 | 4.2% |
| r | 158315 | 4.0% |
| Other values (13) | 279604 | 7.0% |
family
Text
| Distinct | 146 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 183 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 19 |
| Mean length | 12.11108562 |
| Min length | 6 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Scincidae |
|---|---|
| 2nd row | Plethodontidae |
| 3rd row | Homalopsinae |
| 4th row | Gekkoninae |
| 5th row | Polychrotinae |
| Value | Count | Frequency (%) |
| plethodontidae | 221369 | |
| hylinae | 41496 | 7.1% |
| scincidae | 26137 | 4.5% |
| bufonidae | 25125 | 4.3% |
| ranidae | 20319 | 3.5% |
| polychrotinae | 18552 | 3.2% |
| gekkoninae | 17268 | 3.0% |
| phrynosomatinae | 16259 | 2.8% |
| colubrinae | 15640 | 2.7% |
| natricinae | 12705 | 2.2% |
| Other values (136) | 169148 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 930659 | |
| a | 759819 | |
| d | 735567 | |
| o | 715098 | |
| i | 680312 | |
| t | 609836 | |
| n | 542175 | |
| l | 383862 | 5.4% |
| h | 316462 | 4.5% |
| P | 264677 | 3.7% |
| Other values (32) | 1134625 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6489074 | |
| Uppercase Letter | 584018 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 930659 | |
| a | 759819 | |
| d | 735567 | |
| o | 715098 | |
| i | 680312 | |
| t | 609836 | |
| n | 542175 | |
| l | 383862 | |
| h | 316462 | 4.9% |
| r | 170638 | 2.6% |
| Other values (12) | 644646 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 264677 | |
| S | 49954 | 8.6% |
| H | 46430 | 8.0% |
| C | 31134 | 5.3% |
| B | 27124 | 4.6% |
| R | 22836 | 3.9% |
| G | 21056 | 3.6% |
| E | 19510 | 3.3% |
| L | 16895 | 2.9% |
| D | 14735 | 2.5% |
| Other values (10) | 69667 | 11.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7073092 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 930659 | |
| a | 759819 | |
| d | 735567 | |
| o | 715098 | |
| i | 680312 | |
| t | 609836 | |
| n | 542175 | |
| l | 383862 | 5.4% |
| h | 316462 | 4.5% |
| P | 264677 | 3.7% |
| Other values (32) | 1134625 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7073092 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 930659 | |
| a | 759819 | |
| d | 735567 | |
| o | 715098 | |
| i | 680312 | |
| t | 609836 | |
| n | 542175 | |
| l | 383862 | 5.4% |
| h | 316462 | 4.5% |
| P | 264677 | 3.7% |
| Other values (32) | 1134625 |
genus
Text
| Distinct | 1387 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 9.509797175 |
| Min length | 3 |
Unique
| Unique | 139 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Carlia |
|---|---|
| 2nd row | Plethodon |
| 3rd row | Enhydris |
| 4th row | Gehyra |
| 5th row | Anolis |
| Value | Count | Frequency (%) |
| plethodon | 168423 | |
| desmognathus | 35844 | 6.1% |
| anolis | 18333 | 3.1% |
| lithobates | 12993 | 2.2% |
| eleutherodactylus | 9947 | 1.7% |
| anaxyrus | 9476 | 1.6% |
| sceloporus | 8824 | 1.5% |
| emoia | 8211 | 1.4% |
| eurycea | 7626 | 1.3% |
| pseudacris | 6800 | 1.2% |
| Other values (1376) | 297722 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 674870 | |
| e | 454080 | 8.2% |
| t | 412656 | 7.4% |
| s | 399702 | 7.2% |
| l | 372025 | 6.7% |
| a | 366974 | 6.6% |
| h | 357871 | 6.4% |
| n | 346801 | 6.2% |
| d | 269766 | 4.9% |
| i | 237466 | 4.3% |
| Other values (41) | 1663403 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4972908 | |
| Uppercase Letter | 582706 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 674870 | |
| e | 454080 | |
| t | 412656 | 8.3% |
| s | 399702 | 8.0% |
| l | 372025 | 7.5% |
| a | 366974 | 7.4% |
| h | 357871 | 7.2% |
| n | 346801 | 7.0% |
| d | 269766 | 5.4% |
| i | 237466 | 4.8% |
| Other values (16) | 1080697 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 210245 | |
| A | 59587 | 10.2% |
| D | 48689 | 8.4% |
| L | 39050 | 6.7% |
| E | 33523 | 5.8% |
| S | 33115 | 5.7% |
| C | 32240 | 5.5% |
| H | 26213 | 4.5% |
| T | 17139 | 2.9% |
| R | 13671 | 2.3% |
| Other values (15) | 69234 | 11.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5555614 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 674870 | |
| e | 454080 | 8.2% |
| t | 412656 | 7.4% |
| s | 399702 | 7.2% |
| l | 372025 | 6.7% |
| a | 366974 | 6.6% |
| h | 357871 | 6.4% |
| n | 346801 | 6.2% |
| d | 269766 | 4.9% |
| i | 237466 | 4.3% |
| Other values (41) | 1663403 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5555614 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 674870 | |
| e | 454080 | 8.2% |
| t | 412656 | 7.4% |
| s | 399702 | 7.2% |
| l | 372025 | 6.7% |
| a | 366974 | 6.6% |
| h | 357871 | 6.4% |
| n | 346801 | 6.2% |
| d | 269766 | 4.9% |
| i | 237466 | 4.3% |
| Other values (41) | 1663403 |
specificEpithet
Text
Missing 
| Distinct | 5168 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 13122 |
| Missing (%) | 2.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 22 |
| Mean length | 8.884525609 |
| Min length | 3 |
Unique
| Unique | 760 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | bicarinata |
|---|---|
| 2nd row | montanus |
| 3rd row | enhydris |
| 4th row | mutilata |
| 5th row | richardii |
| Value | Count | Frequency (%) |
| cinereus | 75774 | 13.2% |
| glutinosus | 13098 | 2.3% |
| fuscus | 10921 | 1.9% |
| montanus | 10396 | 1.8% |
| jordani | 7140 | 1.2% |
| metcalfi | 6940 | 1.2% |
| cylindraceus | 6103 | 1.1% |
| carolinensis | 5850 | 1.0% |
| teyahalee | 5559 | 1.0% |
| septentrionalis | 4873 | 0.8% |
| Other values (5117) | 427892 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 546156 | |
| s | 515888 | |
| e | 491179 | |
| a | 489969 | |
| r | 404981 | 8.0% |
| u | 401041 | 7.9% |
| n | 359623 | 7.1% |
| c | 308944 | 6.1% |
| t | 280546 | 5.5% |
| o | 262530 | 5.2% |
| Other values (20) | 1012909 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5063007 | |
| Other Punctuation | 6731 | 0.1% |
| Space Separator | 3467 | 0.1% |
| Dash Punctuation | 561 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 546156 | |
| s | 515888 | |
| e | 491179 | |
| a | 489969 | |
| r | 404981 | |
| u | 401041 | |
| n | 359623 | 7.1% |
| c | 308944 | 6.1% |
| t | 280546 | 5.5% |
| o | 262530 | 5.2% |
| Other values (16) | 1002150 |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 6710 | |
| / | 21 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 3467 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 561 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5063007 | |
| Common | 10759 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 546156 | |
| s | 515888 | |
| e | 491179 | |
| a | 489969 | |
| r | 404981 | |
| u | 401041 | |
| n | 359623 | 7.1% |
| c | 308944 | 6.1% |
| t | 280546 | 5.5% |
| o | 262530 | 5.2% |
| Other values (16) | 1002150 |
Common
| Value | Count | Frequency (%) |
| " | 6710 | |
| 3467 | ||
| - | 561 | 5.2% |
| / | 21 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5073766 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 546156 | |
| s | 515888 | |
| e | 491179 | |
| a | 489969 | |
| r | 404981 | 8.0% |
| u | 401041 | 7.9% |
| n | 359623 | 7.1% |
| c | 308944 | 6.1% |
| t | 280546 | 5.5% |
| o | 262530 | 5.2% |
| Other values (20) | 1012909 |
Missing 
| Distinct | 1460 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 556206 |
| Missing (%) | 95.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 22 |
| Mean length | 9.076299339 |
| Min length | 3 |
Unique
| Unique | 314 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | occidentalis |
|---|---|
| 2nd row | curta |
| 3rd row | consobrinus |
| 4th row | trinidadensis |
| 5th row | ignigularis |
| Value | Count | Frequency (%) |
| viridescens | 1460 | 5.2% |
| blanchardi | 1211 | 4.3% |
| fasciata | 1043 | 3.7% |
| elegans | 911 | 3.2% |
| undulatus | 640 | 2.3% |
| ordinatus | 395 | 1.4% |
| stejnegeri | 390 | 1.4% |
| louisianensis | 365 | 1.3% |
| dorsalis | 343 | 1.2% |
| fuscus | 318 | 1.1% |
| Other values (1442) | 21119 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 29397 | |
| a | 29037 | |
| s | 26503 | |
| e | 19772 | 7.8% |
| n | 17844 | 7.0% |
| r | 16955 | 6.7% |
| u | 15678 | 6.2% |
| l | 14637 | 5.8% |
| t | 13777 | 5.4% |
| o | 13517 | 5.3% |
| Other values (17) | 56974 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 253891 | |
| Space Separator | 200 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 29397 | |
| a | 29037 | |
| s | 26503 | |
| e | 19772 | 7.8% |
| n | 17844 | 7.0% |
| r | 16955 | 6.7% |
| u | 15678 | 6.2% |
| l | 14637 | 5.8% |
| t | 13777 | 5.4% |
| o | 13517 | 5.3% |
| Other values (16) | 56774 |
Space Separator
| Value | Count | Frequency (%) |
| 200 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 253891 | |
| Common | 200 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 29397 | |
| a | 29037 | |
| s | 26503 | |
| e | 19772 | 7.8% |
| n | 17844 | 7.0% |
| r | 16955 | 6.7% |
| u | 15678 | 6.2% |
| l | 14637 | 5.8% |
| t | 13777 | 5.4% |
| o | 13517 | 5.3% |
| Other values (16) | 56774 |
Common
| Value | Count | Frequency (%) |
| 200 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 254091 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 29397 | |
| a | 29037 | |
| s | 26503 | |
| e | 19772 | 7.8% |
| n | 17844 | 7.0% |
| r | 16955 | 6.7% |
| u | 15678 | 6.2% |
| l | 14637 | 5.8% |
| t | 13777 | 5.4% |
| o | 13517 | 5.3% |
| Other values (17) | 56974 |
taxonRank
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 556206 |
| Missing (%) | 95.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | subspecies |
|---|---|
| 2nd row | subspecies |
| 3rd row | subspecies |
| 4th row | subspecies |
| 5th row | subspecies |
| Value | Count | Frequency (%) |
| subspecies | 27995 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 83985 | |
| e | 55990 | |
| u | 27995 | 10.0% |
| b | 27995 | 10.0% |
| p | 27995 | 10.0% |
| c | 27995 | 10.0% |
| i | 27995 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 279950 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 83985 | |
| e | 55990 | |
| u | 27995 | 10.0% |
| b | 27995 | 10.0% |
| p | 27995 | 10.0% |
| c | 27995 | 10.0% |
| i | 27995 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 279950 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 83985 | |
| e | 55990 | |
| u | 27995 | 10.0% |
| b | 27995 | 10.0% |
| p | 27995 | 10.0% |
| c | 27995 | 10.0% |
| i | 27995 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 279950 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 83985 | |
| e | 55990 | |
| u | 27995 | 10.0% |
| b | 27995 | 10.0% |
| p | 27995 | 10.0% |
| c | 27995 | 10.0% |
| i | 27995 | 10.0% |